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Sexism has no place in science 


The comments about women in the laboratory made by Nobel laureate Tim Hunt are a reminder 


that equality in science is a battle still far from won. 


Korea, at which the British Nobel prizewinner Tim Hunt 

expressed jaw-dropping and belittling sentiments about women 
in the laboratory, is focusing minds, once again, on how to make the 
most of that half of the human population in research. 

All involved in science should condemn the comments, which 
suggested that single-sex labs might be preferable because “girls” tend 
to fall in love with their male colleagues and cry when criticized. 

Hunt — who won the 2001 Nobel Prize in Physiology or Medicine 
for his work on cell division — said that his remarks were intended to be 
light-hearted. Following a storm on Twitter and much media coverage, 
he was asked to step down from various influential roles. 

Some public and media analysis of the sorry story has now shifted 
to argument about whether the punishment fits the crime. In an inter- 
view with the Observer newspaper last weekend, Hunt complained that 
he had been “hung out to dry”. 

Whether or not the reaction has been fair, his comments and 
attitudes have become shorthand for the dismaying extent to which 
sexism still pervades science, and serve as a prompt to discuss the 
problem. 

The problem is worth, once again, stating clearly. In the United 
States and Europe, around half of those who gain doctoral degrees 
in science and engineering are women — but barely one-fifth of full 
professors are women. Women are not invited in significant numbers 
to sit on the scientific advisory boards of start-up companies. A sci- 
entific conference at which half of the keynote speakers are women 
stands out simply because of that. 

The challenge and the consequences of the lack of women in 
research — especially at senior levels — are expressed in many 
ways, with worrying persistence. Evidence suggests that too many 
women encounter patronizing attitudes or harassment in research 
contexts — whether at work, at academic conferences or in the field. 
Recent evidence, such as studies of mock hiring exercises, and analy- 
sis of grant success rates, authorship assignment and citation counts, 
suggests that discrimination against women runs deep in the psyches 
of both genders. 

The problem is serious and long-standing. But there are plenty of 
ways to tackle it. Nature has discussed and promoted them before, and 
is happy to do so again. Here is a list of measures to consider afresh: 


| ast week’s incident at a meeting of science journalists in South 


@ Recognize and address unconscious bias. Graduate students given 
grants by the US National Institutes of Health are required to undergo 
ethics training. Gender-bias training for scientists, for example, would 
be a powerful way to help turn the tide. 

e Encourage universities and research institutions to extend the 
deadlines for tenure or project completion for scientists (women and 
men) who take parental leave, and do not penalize these researchers by 
excluding them from annual salary rises. Many workplaces are happy 


to consider and agree to such extension requests when they are made. 
The policy should simply be adopted across the board. 

e Events organizers and others must invite female scientists to lecture, 
review, talk and write articles. And if the woman asked says no — for 
whatever reason — then ask others. This is about more than mere 
visibility. It can boost female participation too. Anecdotal reports sug- 
gest that women are more likely to ask questions in sessions chaired 
by women. After acknowledging our own bias 


“Too many towards male contributors, Nature, for example, 
women is engaged in a continued effort to commission 
encounter more women in our pages. 

patronizing @ Do not use vocabulary and imagery that 
attitudes or support one gender more than another. Words 
harassment matter. It is not ‘political-correctness-gone- 
in research mad’ to avoid defaulting to the pronouns ‘him’ 


contexts.” and ‘he; or to ensure that photographs and 
illustrations feature women. 

e@ In communication and promotional materials, highlight women 
who have made key contributions to previous work, whether in 
your own lab or within your research discipline more broadly. 

@ Be aware of the importance of informal settings and social activities 
to workplace culture, and people’ sense of their place within it. Senior 


scientists can, where possible, make such events inclusive. 


The lot of the female scientist in most developed countries is better 
than it was a few decades ago — a time that forged the thinking and 
attitudes of many of today’s senior scientists. But such attitudes con- 
tinue to prevent equality. It is right to highlight and protest against 
examples of explicit and implicit bias — of all types — in research. 
And it is essential that all involved strive for better. m 


The right climate 


A Republican US presidential candidate speaks 
on climate change, but will his party listen? 


contender for the Republican presidential nomination earlier 

this month. He is a staunch conservative who tends to vote with 
his party on everything from gun control to health care and foreign 
policy. He is also the first Republican candidate to squarely address 
the question of climate change — in a constructive way. During an 
interview on the CNN news channel on 7 June, Graham highlighted 
the problem and issued a welcome challenge to his fellow Republicans. 


U; Senator Lindsey Graham of South Carolina became the ninth 
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“Here’s a question you need to ask everybody running as a 
Republican: what is the environmental policy of the Republican Party?” 
Graham said. “When I ask that question, I get a blank stare” 

Graham could not be more correct. It has been clear for some time 
that climate change is a defining social, and therefore political, issue 
for the twenty-first century. Questions remain about what kind of 
impacts to expect and how best to reduce greenhouse-gas emissions, 
while extending the benefits of modern industry to the world’s poorest 
citizens. But the core science is solid, and policy-makers at all levels 
have a responsibility to engage with it. Sadly, the Republican Party's 
strategy in Congress thus far has been to ignore or dodge the problem, 
or to deny it outright. 

Fellow candidates such as senators Marco Rubio of Florida and 
Ted Cruz of Texas have both questioned the science of global warming 
or humanity’s contribution to it, as has former Pennsylvania senator 
Rick Santorum. When the US Senate took up a pair of symbolic resolu- 
tions in January, just 15 Republicans voted in favour of an amendment 
declaring that human activity contributes to climate change, and only 
5 were willing to support a resolution stating that the human contribu- 
tion is significant. 

So far, the only significant Republican engagement on climate has 
come in the form of opposition to regulations being imposed by the 
administration of President Barack Obama. Those regulations are in 
place for cars and trucks, and the administration is expected to finalize 
a rule to limit carbon pollution by the electricity sector as early as next 
month. Just last week, the Environmental Protection Agency released 
a draft “endangerment” finding that will enable the agency to regulate 
emissions from aeroplanes, in accordance with standards being negoti- 
ated through the International Civil Aviation Organization. 

Many on the right — including Senator Graham — have argued 
against these regulatory efforts, and there are legitimate reasons for doing 


so. Most experts agree that it would be much more effective to tackle the 
issue ina comprehensive way, whether through a market-based regula- 
tory emissions scheme like that adopted in the European Union, or a 
simple carbon tax. An insurgent environmental faction in the Republican 
Party is pushing for the latter, perhaps paired with a dividend system that 
would make the tax revenue-neutral and offset the higher prices paid 
by consumers for petrol and for fossil-fuel-based electricity, as a sim- 

pler conservative solution. But before they can 


“The rest of even think about laying outa proactive agenda, 
the world has Republicans must acknowledge the problem. 
moved beyond In this sense, Senator Graham’s challenge 
questions about __ isa step forward. Not only is it in line with 
whether climate mainstream science, but it also chimes 


with mainstream views within the US public. 
A March survey by the Yale Project on Cli- 
mate Change Communication found that 63% of Americans believe 
that global warming is happening and 52% think that it is mostly caused 
by humans; just 18% think that it is not happening, with 32% believ- 
ing that it is mostly due to natural environmental factors. Also, 71% of 
Americans “mostly” or “somewhat” trust climate scientists, compared 
with 27% who mostly or somewhat distrust them. Even the corporate 
community that Republicans claim to represent is beginning to engage 
with climate change. And Pope Francis has issued his own call to action 
on climate change this week (see go.nature.com/19lurz). 

The rest of the world has moved beyond questions about whether 
climate change is real and is focused on how best to address it. There 
is plenty of pressure on Republicans to do the same, but the challenge 
to party orthodoxy ultimately needs to come from within. Senator 
Graham has put the problem on the Republican presidential agenda. 
To build momentum, he and other brave souls will now need to start 
talking about solutions. m 


change is real.” 


Risky business 


Funding agencies should highlight their roles 
as risk managers to underpin public trust. 


good governance includes regular evaluation of risks — risks 

to employees, to security, to processes, to reputation, to finan- 
cial viability — and, where these risks are considered excessive, actions 
to mitigate them. Risk assessments of this type need to be specific, so 
inevitably they are confidential. 

Research funders who steward the allocation of millions or billions 
of dollars routinely assess and manage such risks. But they also deal 
with other kinds of risk — and success in those areas should not be 
kept quiet. 

There are two types of risk management that funders should high- 
light: one that deals with bad risk, and the other with good. 

The bad risks involve funds being wasted on inept, sloppy or 
downright fraudulent research. Funding agencies are usually 
responsible to taxpayers; it is therefore crucial that they are publicly 
accountable for ensuring that they support robust science. This is 
easily said. How can it be done? The least that the funder can do 
is to ensure that the institutions that it funds have procedures for 
tackling sloppy or dishonest research when it arises and, better still, 
for reducing the likelihood of such problems through appropriate 
scrutiny and training. 

Journals tackling the challenges of reproducibility have found that 
measures to ensure that research is robust can be onerous for editors, 
referees and authors. The same will be true for funders. But if public 
trust in science is to be maintained, funders need to mitigate bad risks 


A nyone who has helped to run an organization knows that 
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through assessment procedures. They could, for example, ensure that 
claims about the statistical power of proposed experiments stand up. 
What is more, they need to be seen to be doing this. 

One funder that has taken these responsibilities seriously is the 
US National Institutes of Health. To tackle poor reproducibility in 
biomedical research, it has arranged discussion meetings, intro- 
duced training programmes and made its assessment of grant 
applications more rigorous. It has also made these initiatives 
highly visible: see go.nature.com/8jhvwgq. Others should follow its 
example. 

Then there are the good risks. Ata meeting of the Global Research 
Council in Tokyo last month, funders produced a declaration of prin- 
ciples to address the challenges of funding scientific breakthroughs 
(see go.nature.com/nrlxmd). The most consistent point was that 
funders should feel able to take positive risks — that is, to support 
daring ideas that have a chance of failure. They should “encourage 
risk-taking and tolerate failure in research activities” And they should 
“provide researchers with the flexibility and intellectual space needed 
for serendipity”. Much of this comes down to funders ensuring “the 
freedom of researchers in defining their topics, their methodologies 
and their resource allocations”. 

Although such boldness is not easy to implement, researchers know 
what the combination of great talent and bold ideas can deliver. Out- 
siders, including governments and taxpayers, might not appreciate the 
potential value of positive risk-taking. But it goes with the territory of 
supporting outstanding science. 

It is important, therefore, that funding-agency managers empower 
selection committees to accept a degree of risk in supporting boldness 
in research, and to back researchers up when 
(and it will be when, not if) some of those risks 
do not pay off. Any funding agency that lives 
up to these principles should proclaim its risk- 
taking mission with pride. m 
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R L D V EW A personal take on events 


modified rules to regulate research with human participants. At 

the heart of the issue — and one likely reason why the discussions 
are taking so long — is the question of risk, and how to assess it. In fact, 
the goal of the new rules is to more effectively link how risky a project 
is perceived to be for the people taking part with how it should be regu- 
lated. Hence the name: risk-based regulation. 

How to assess risk in research with humans has been a persistent 
problem. These decisions are typically taken by institutional review 
boards (IRBs), but studies have shown that different boards, and indeed 
different members of the same IRBs, judge the risk of identical proce- 
dures in very different ways. 

This variation in risk assessment contributes to multiple problems 
as reflected in the title of the 2011 US advanced 
notice of proposed rule-making: Enhancing Pro- 
tections for Research Subjects and Reducing Bur- 
den, Delay, and Ambiguity for Investigators (see 
go.nature.com/x8rfth). 

The current US regulatory framework known 
as the common rule calls for the evaluation of 
‘probability and magnitude of harm or discom- 
fort. This language implies that IRBs can make 
objective assessments of risk and wrongly assumes 
that accurate predictions about experimental out- 
comes will be possible. Moreover, verdicts such as 
‘minimal risk and ‘minor increase over minimal 
risk’ do not distinguish whether the increase is 
because of the perceived extent of added risk or 
the uncertainty of that added risk. Two IRB mem- 
bers, in other words, can reach the same decision 
for very different reasons. This can make it harder 
for groups to reach a consensus. 

How should we think differently about risk? A good starting point 
would be to ensure that the risk to humans is always considered in con- 
text and not in an abstract sense. For instance, chairs of IRBs have been 
shown to grade the risk of lumbar puncture in 11-year-old children 
without sedation differently if they are told the children are healthy or 
ill and have undergone the procedure previously (S. Shah et al. JAMA 
291, 476-482; 2004). 

In my view, a better way to assess and discuss risk is by using a method 
of inquiry called post-normal science (PNS). Introduced in the 1990s, 
PNS was put forth as a heuristic risk-assessment framework to assist 
decision-making at the interface between environmental science and 
public policy (S.O. Funtowicz and J. R. Ravetz Futures 25, 739-755; 1993). 
Designed to address specific scenarios, it always 


| four years, officials in the United States have been working on 


focuses on contextual rather than abstract risk. > NATURE.COM 
Importantly, PNS makes no attempt to merge _ Discuss this article 
magnitude and probability of risk, which are _ onlineat: 


referred to as ‘decision stakes’ and ‘system __go.nature.com/vuwdsu 
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Rethink our approach 
to assessing risk 


The ‘post-normal science’ framework would make regulatory decisions 
about research in humans more effective, says Frederick Grinnell. 


uncertainties’ — to reflect the emphasis on value, not quantitative, 
judgements, which are required of regulatory groups. As an example of 
PNS applied to human research, the decision stakes (added risk) for a 
highly toxic intervention that is first undergoing testing would be higher 
for healthy people than for patients who have the condition under inves- 
tigation. In this example, the system uncertainties (uncertainty about 
the occurrence of added risk) are similar for both groups. 

However, the system uncertainties would be higher, say, ifa procedure 
such as cardiac catheterization in healthy people was to be performed by 
a physician with little experience, compared with one witha highly suc- 
cessful track record. (The decision stakes, added risk, would be similar.) 

Under the PNS approach, research administrators and IRB members 
would assess both criteria to allocate proposed research to one of three 
separate risk domains, which demand different 
levels of scrutiny and regulation. 

PNS designates the lowest of these domains 
as ‘applied science: Research currently typically 
categorized as exempt or expedited has relatively 
low decision stakes and low system uncertainty 
and would fall into this domain in which the 
principal investigator is given the responsibility 
to carry out the work with minimal oversight. 

As levels of decision stakes and system 
uncertainty rise to intermediate, proposals 
are placed in the second category, which in 
PNS is called professional consultancy. This 
is the domain of ongoing involvement of IRBs 
and the increasing importance of participant 
understanding of the research to give informed 
consent. PNS use of separate criteria would 
lead to clearer analysis of what makes a project 
more or less risky, and help IRBs to reach decisions. 

The final category, the domain of ‘post-normal science; is reserved for 
projects for which system uncertainties are so high that they include the 
adequacy of the current ethical principles used to assess risk. Values, in 
other words, become as important as facts. Managing risk in this zone 
needs extended consultation with a wider community, and one that 
assesses social values as well as scientific facts and expertise. A recent 
example of science that falls into this top tier is the question of whether 
or not to genetically engineer human embryos. 

Using PNS to assess risk in human research has received little atten- 
tion so far. But it offers a more coherent approach and permits a more 
nuanced analysis than the current regulatory framework. It would 
promote the goal of risk-based regulation of human research. m 


Frederick Grinnell is a professor in the Department of Cell Biology 
at the UT Southwestern Medical Center, University of Texas in Dallas, 
Texas. 

e-mail: frederick.grinnell@utsouthwestern.edu 
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RESEARCH HIGHLIGHTS 


Chimps are not 
averse to alcohol 


Chimpanzees at a site in 

West Africa drink alcohol- 
containing sap from raffia trees 
— the first systematic evidence 
that non-human apes ingest 
naturally fermented food in 
the wild. 

In Guinea, local people 
collect this sap by hanging 
containers under holes tapped 
near the crowns of raffia palms 
(Raphia hookeri). Kimberley 
Hockings at the Centre for 
Research in Anthropology 
in Lisbon and her colleagues 
found that the chimps dip 
crumpled leaves into the 
containers to sponge out the 
sweet sap, which contains 
about 3% ethanol. 

The results suggest that 
the last common ancestor of 
African apes and humans could 
tolerate the consumption of 
ethanol. 

R. Soc. Open Sci. 2,150150 (2015) 


Cuckoo finchisa 
master of mimicry 


Some birds use other species’ 
nests to lay eggs that resemble 
those of the other species, 
escaping the burdens of 
parenthood. Now researchers 
have found a type of finch in 
Africa that also mimics how 
another bird looks as an adult. 
The female cuckoo finch 
(Anomalospiza imberbis; 
pictured left) lays its eggs in 


Selections from the 
scientific literature 


HYDROLOGY 


Floods used as defensive weapons 


One-third of all major floods recorded in the 
Netherlands over the past 500 years were caused 
deliberately by humans to defend the territory 


during wartime. 


Adriaan de Kraker of Free University 
Amsterdam trawled historical maps, land 
ownership documents and written accounts 
of flooding events in the low-lying southwest 
Netherlands, where three rivers flow into the 
North Sea. The largest floods occurred either 
during storm surges when sea water breached the 


the nests of tawny-flanked 
prinias (Prinia subflava) but 
also strongly resembles the 
female of acommon African 
species called the southern red 
bishop (Euplectes orix; pictured 
right). The resemblance, 
studied by William Feeney at 
the University of Cambridge, 
UK, and his colleagues, may 
have evolved initially to help 
cuckoo finches to sneak closer 
to the prinias nests. But 
prinias have evolved what 
seems to be a countermeasure: 
they attack the finch and the 
southern red bishop with equal 
aggression. 

Proc. R. Soc. B 282, 20150795 
(2015) 


258 | NATURE | VOL 522 | 18 JUNE 2015 


flood defences, or during wartime when people 
destroyed dikes to deter enemies such as the 
Spanish in the sixteenth century. The tactic was 


also used by both sides during the Second World 


Graphene shines 
bright ina vacuum 


Physicists have designed a 
2D device that can emit bright, 
visible light, paving the way 
for ultra-thin light-emitting 
devices. 

One-atom-thick layers 
of carbon called graphene 
can withstand the high 
temperatures at which they 
emit light. But when graphene 
is stuck to a surface and 
electricity is applied, only a 
small amount of light energy 
is released because too much 
energy dissipates as heat. 
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War (pictured are residents of Walcheren being 
rescued from floods in 1945, after a sea wall was 
breached). Strategic flooding was not always 
successful in deterring the enemy, de Kraker 
says, and caused as much damage to land and 
property as natural floods did. 

Hydrol. Earth Syst. Sci. 19, 2673-2684 (2015) 


Young Duck Kim at Seoul 
National University and his 
colleagues got around the 
problem by putting a current 
through a piece of graphene 
suspended between two 
electrodes in a vacuum. By 
isolating the graphene, they 
were able to heat it to more 
than 2,500°C and increase the 
level of radiation by 1,000 times 
compared to graphene on a 
substrate. 

With further improvements, 
the device could serve as a 
nanoscale light emitter in ultra- 
thin displays, say the authors. 
Nature Nanotech. http://dx.doi. 
org/10.1038/nnano.2015.118 
(2015) 
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ASTRONOMY 


Stars seen forming 
in a far-off galaxy 


Astronomers have seen their 
best glimpse yet of stars 
forming in the early Universe. 
The ALMA radio telescope 
in Chile explored the SDP.81 
galaxy, which is 3.6 billion 
parsecs (11.7 billion light years) 
away from Earth. Its light was 
magnified and distorted by the 
gravitational pull of another 
galaxy between it and Earth, 
but a model developed by 
Yoichi Tamura of the University 
of Tokyo and his team 
corrected the distortions. Their 
images reveal many cold clouds 
of dust and gas that are driving 
a rapid rate of star formation. 
Several research teams have 
analysed the ALMA data to 
characterize other aspects of 
this galaxy. 
Publ. Astron. Soc. Japan 
http://doi.org/5bv (2015) 


Evaporation 
drives engine 


An engine fuelled only by 
water evaporation can power a 
miniature car and lights. 
Ozgur Sahin at Columbia 
University in New York and 
his colleagues applied bacterial 
spores to thin plastic strips. 
The spores absorb and release 
water with changes in relative 
humidity, so the strips curl 
and straighten. The team 
stacked the strips and formed 
them into a water-containing 
engine so that the strips were 
exposed to recurring periods of 
high and low humidity, acting 
like oscillators to power the 
engine. When attached to a 
generator, the engine powered 
light-emitting diodes. A rotary 
version attached to two pairs of 
wheels (pictured, left) pushed 


a 100-gram car forwards 
(pictured, right). 

The engine could be used in 
devices in areas that have scarce 
electricity, the authors say. 
Nature Commun. 6, 7346 (2015) 


Angry spiders spin 
long-lived colonies 


Animal personality can 
influence the lifespan of whole 
ecological communities. 
Female Anelosimus studiosus 
spiders create large webs shared 
by multiple females of this 
species and by many other 
types of spider. Jonathan Pruitt 
and Andreas Modlmeier of 
the University of Pittsburgh in 
Pennsylvania started colonies 
with either two docile spiders 
or two aggressive ones. Docile 
females allowed many kinds 
of spider to colonize and 
scavenged the other spiders’ 
prey. Aggressive ones caught 
their own prey and fought 
off competing and predatory 
spiders. All of the communities 
collapsed in the end, but those 
founded by aggressive females 
lasted up to three years longer. 
Communities founded 
by docile spiders reached a 
similar destabilizing species 
composition to aggressive 
spiders’ colonies, but did so ata 
faster rate, making the docile- 
founded colonies shorter lived. 
J. Anim. Ecol. http://doi.org/5bt 
(2015) 


GENOME EDITING 


CRISPR system 
controlled by light 


A popular genome-editing 
technique has been modified 
so researchers can use light to 
control when editing occurs 
and in which cells. 

To better control how the 
CRISPR-Cas9 system cuts 
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Popular ince 


SOCIAL SELECTION 


on social media 


Unreliable dietary data under fire 


Much of the data on personal eating habits — which form the 
basis of many national dietary guidelines — are unreliable and 
misleading, according to a report in Mayo Clinic Proceedings 
that drew attention on social media. The authors argue that the 
widely used survey method of asking people to recall their food 
and drink intake is so inaccurate that any conclusions based 

on such data should be called into question. “Must read paper 
in nutrition! At what point does inaccurate data become worse 
than no data?” tweeted Dylan MacKay, a human-nutrition 
researcher at the University of Manitoba in Winnipeg, Canada. 
But Brenda Davy, a clinical nutritionist at Virginia Polytechnic 
Institute and State University in Blacksburg, writes in an 
accompanying editorial that whereas self-reports are imperfect, 
they have helped to uncover vital nutritional links, such as that 
between fat intake and heart disease. 

Mayo Clin. Proc. http://doi.org/5dd; http://doi.org/5df (2015) 


Based on data from altmetric.com. 
Altmetric tracks and compiles online 
attention around research outputs 
such as papers, books and data sets. 


and alters DNA sequences, 
Moritoshi Sato and his team 
at the University of Tokyo in 
Japan split the Cas? DNA- 
cutting enzyme in two and 
fused each part toa protein. 
These proteins join together 
when hit with blue light, 
reactivating Cas9. They found 
that the modified enzyme 
could still edit the genome 
accurately in human cells in 
the lab. The frequency of edits 
increased with longer light 
exposures and no editing was 
seen in cells kept in the dark. 
The team also shone patterns 
of light onto the cells and found 
that genome editing occurred 
along the same patterns. 
The technique could help 
to improve the precision of 
CRISPR, the authors say. 
Nature Biotech. http://dx.doi. 
org/10.1038/nbt.3245 (2015) 


REGENERATIVE BIOLOGY 


Mammary ducts 
made in a dish 


Cells from human mammary 
glands develop in the lab 

into structures that look and 
behave like milk-producing 
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ducts in the breast. 

A team led by Christina 
Scheel at the Helmholtz Center 
for Health and Environmental 
Research in Munich, 
Germany, obtained mammary 
gland tissue from healthy 
women undergoing surgery 
and isolated cells that line the 
ducts of the glands. The team 
suspended individual cells 
in a3D flexible collagen gel, 
added growth factors, and 
found that the cells developed 
into structures that resembled 
the key functional units of the 
mammary gland. Some of the 
structures even contracted 
in the same way as ducts do 
during lactation. 

Cells developed into duct- 
like structures only when 
they were grown in collagen 
gels that were not physically 
attached to the culture dish, 
suggesting that a flexible 
matrix is needed for normal 
growth of this tissue. 
Development http://dx.doi. 
org/10.1242/dev.123554 (2015) 
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SEVEN DAYS sescnsi 


EVENTS 


Comet probe wakes 


Philae, the space probe 

that landed on a comet in 
November 2014, is awake. 
The European Space Agency 
(ESA) announced that it had 
received signals from the 
lander on 13 June. Philae’s 
battery ran out just three 
days after it landed on 67P/ 
Churyumov-Gerasimenko. 
Stuck in the shade, it was 
unable to charge its solar- 
powered battery and entered 
hibernation. Scientists had 
hoped that Philae would 
regain power as the comet 
neared the Sun. See page 263 
for more. 


Oil-sands petition 
A group of more than 

100 North American 
scientists released a petition 
calling for a moratorium on 
the development of Canadian 
oil sands. Issued on 10 June, 
the statement claims that 
crude oil from oil sands is 
responsible for more carbon 
emissions than are other 
transport fuels. Its use also 
pollutes the air and water. 
Continued development of 
the oil sands in Alberta is 
therefore inconsistent with 
efforts to reduce greenhouse- 
gas emissions unless it is 
coupled with a larger plan to 
address global warming, the 
petitioners say. 


Curve honoured 


The iconic ‘Keeling curve; 

a 57-year global record 

of carbon dioxide in the 
atmosphere, was made a US 
National Historic Chemical 
Landmark on 12 June. The 
late Charles David Keeling, 
a geochemist at the Scripps 
Institution of Oceanography 
in La Jolla, California, began 
the CO, measurements 

in 1958 at Mauna Loain 
Hawaii, and the curve has 
become a record of human 


Zoo escape adds to Georgia's flood woes 


Georgia declared 15 June a national day of 
mourning after severe flooding killed at least 
12 people and led to the escape of zoo animals 
in Tbilisi. A further 24 people are missing after 


effects on the atmosphere. 
The designation is from 
the American Chemical 
Society, which has held 
commemoration events at 
the Scripps campus and the 
Mauna Loa site. 


FACILITIES 


Array amplification 
The Telescope Array, a Japan- 
led cosmic-ray observatory in 
Utah, will nearly quadruple 

in size to about 2,500 square 
kilometres, the University 

of Utah in Salt Lake City 
announced on 15 June. 

Japan pledged ¥568 million 
(US$4.6 million) to add 
scintillation detectors, and 

US collaborators will seek 
$1.8 million to build extra 
fluorescence detectors. The 
upgrade could take three years 
to complete. Researchers hope 
that the expansion will help 
them to determine the nature 
of an apparent ‘hotspot’ of 
ultra-high-energy cosmic rays 
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coming from the part of the 
sky near Ursa Major. 


Neutrino switch 


Neutrinos — subatomic 
particles that come in 

three ‘flavours’ — have 

been confirmed to switch 
spontaneously from muon 

to tau flavour. Physicists 

in the OPERA experiment 
detected five tau neutrinos in 
a muon neutrino beam that 
travelled 730 kilometres from 
Switzerland to the Gran Sasso 
underground laboratories in 
Italy. OPERA spokesperson 
Giovanni De Lellis 
announced on 16 June that 
the phenomenon has been 
achieved with the necessary 
statistical precision for it to be 
considered a discovery. 


Aviation emissions 


The US Environmental 
Protection Agency issued a 
draft ruling on 10 June that 
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heavy rains. People in Tbilisi were warned to stay 
in their homes after various animals — including 
a bear and hippopotamus — escaped from the 
z00 after floods destroyed enclosures. 


greenhouse-gas emissions 
from aviation endanger 
human welfare. A formal 
finding of ‘endangerment’ 
would enable the agency to 
set climate regulations for 
aircraft under the Clean 
Air Act, building on similar 
regulations that have 
already been established 

for cars. The EPA said 

that it is participating in 

an international effort to 
develop carbon dioxide 
standards led by the 
International Civil Aviation 
Organization, which is 
expected to produce a set of 
standards in early 2016. 


Emissions peak 
Global energy-related 
emissions could peak as 
early as 2020, according to 

a report published by the 
International Energy Agency 
on 15 June. Achieving the 
goal would require countries 
to increase energy efficiency 
and use of renewable 
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w energies, reduce reliance 

E on coal, phase out fossil- 

% fuel subsidies and reduce 

4 methane emissions from 

é oil and gas production. The 

2 report also laid out steps 
to achieve a successful 
international climate 
agreement in Paris this 
December: negotiators must 
focus on an early peak in 
energy emissions, set up a 
five-year review of national 
targets, establish a long-term 
emissions goal and create a 
process to track progress. 


NIH role changes 


Sally Rockey, long-time 
head of the extramural 
research programme at the 
US National Institutes of 
Health (NIH), will leave 

the agency in September, 
she announced on 11 June. 
Rockey, well known for her 
Rock Talk blog about NIH 
grants, will now head a non- 
profit corporation called the 
Foundation for Food and 
Agriculture Research, which 
seeks private donations for 
research (see Nature http:// 
doi.org/5dm; 2015). On the 
same day, the NIH named 
neurologist Walter Koroshetz 
as director of the US National 
Institute of Neurological 
Disorders and Stroke. 
Koroshetz has served as the 
institute’s interim director 
since October 2014. 


TREND WATCH 


SOURCE: J. KAMP ET AL. CONSERV. BIOL. HTTP://DOI.LORG/5DH; 2015 


The population of what was once 
one of Eurasia’s most abundant 
songbirds has crashed. Numbers 
of the yellow-breasted bunting 
(Emberiza aureola) fell by 
84-95% between 1980 and 2013 
and its range has shrunk by 
5,000 kilometres (J. Kamp et al. 
Conserv. Biol. http://doi.org/5dh; 
2015). The bird is now thought 
to be extinct in the northwestern 
parts of its habitat. The authors 
of the study say that widespread 
trapping of the birds in China 
may be partly to blame, but that 


other drivers cannot be ruled out. 


Nature knighthood 
The editor-in-chief of 

Nature, Philip Campbell, was 
among those recognized for 
contributions to science in 

the United Kingdom's 2015 
Queen's birthday honours. 
Campbell (pictured) received 
a knighthood, one of the 
highest distinctions in UK 
honours system, alongside 
metallurgist Harshad 
Bhadeshia of the University 
of Cambridge, UK, Peter 
Downes, vice-chancellor of 
the University of Dundee, UK, 
and other researchers. Among 
those made dames were 
Frances Ashcroft, a diabetes 
researcher at the University of 
Oxford, UK, and Anne Glover, 
the former chief scientific 
adviser to the European 
Commission. See go.nature. 
com/gvzy22 for more. 


Frozen-ovary first 
A healthy baby boy has been 
born to a woman transplanted 
with her own preserved 


BIRD EXTINCTIONS 


The range of the yellow-breasted 
bunting has shrunk so much that 
the bird now seems to be extinct 
in many regions where it was 
once common. 


predicted 
extinction times 


Contours separate 


ovarian tissue, removed in 
childhood, researchers said on 
9 June. The woman had had 
her right ovary removed as a 
consequence of treatment for 
sickle-cell anaemia. Tissues 
from it were frozen, and were 
transplanted back a decade 
later. Two years after that, the 
woman conceived naturally, 
giving birth in November 2014 
(I. Demeestere et al. Hum. 
Reprod. http://doi.org/5dj; 
2015). Babies have been born 
from frozen ovarian tissue 
before — but previous cases 
have always involved tissue 
taken from adults. 


US chimp ruling 
Biomedical research using 
chimpanzees in the United 
States may become nearly 
impossible, following a 12 June 
ruling from the US Fish and 
Wildlife Service (FWS) that 
designated the animals an 
endangered species. Wild 
chimpanzees have been on 
the endangered list since 
1990, but until now their 
captive counterparts have 
been exempted. The FWS 
rule makes it illegal to harass 
or harm captive chimpanzees 
unless the research also 
benefits the animals in the 
wild. US chimpanzee research 
was sharply curtailed in 
2013, when the US National 
Institutes of Health retired 
most of its chimps. See 
go.nature.com/fvkouw for 


more. 
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SEVEN DAYS | THIS WEEK | 


22-26 JUNE 

Some 600 astrophysicists 
gather in Tenerife, 

Spain, for the annual 
meeting of the European 
Astronomical Society. 
go.nature.com/cnkj7g 


22-26 JUNE 
Pathogen detection, 
biosurveillance 

and point-of-care 
technologies will be 
discussed by leaders 
from government, 
academia and industry 
at the Biodefense World 
Summit in Bethesda, 
Maryland. 
go.nature.com/ymuyui 


24-27 JUNE 

The International 
Society for Stem Cell 
Research holds its annual 
meeting in Stockholm. 
More than 4,000 experts 
from 50 countries are 
expected to attend. 
go.nature.com/achzu3 


| __BUSINESS 
Cholesterol drugs 


An emerging class of 
cholesterol drug has won 
overwhelming support from 
advisers to the US Food 

and Drug Administration 
(FDA). On 9 and 10 June, 
advisers voted in favour of 
recommending FDA approval 
for two drugs that target a 
protein called PCSK9, which 
is involved in cholesterol 
processing by the liver. Drugs 
that inhibit PCSK9 boost 
cholesterol uptake (see Nature 
496, 152-155; 2013). The 

two drugs — alirocumab 
(Praluent) from Paris-based 
Sanofi and evolocumab 
(Repatha) from Amgen of 
Thousand Oaks, California 
— are under consideration for 
treating a genetic condition 
that causes high cholesterol. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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Philae comet lander wakes up — with enough power to do experiments. 


BY ELIZABETH GIBNEY 
CC ‘e all said the chance that the 
lander would wake up was 
50:50, but actually many 
of us thought it was much less,” says Uwe 
Meierhenrich, an analytical chemist at the Uni- 
versity of Nice Sophia Antipolis in France. He 
is one of a group of scientists from the Rosetta 
comet mission who met last week in Rome 
to hash out a plan should their lander, Philae, 
awaken from its slumber on the surface of a 
comet. Not a peep had been heard from Philae 
since its battery ran out seven months ago, 
three days after a historic landing on comet 
67P/Churyumov-Gerasimenko. 

The doubters were in for a surprise. Days 


later, at 22:28 Central European Standard Time 
on 13 June, the European Space Agency (ESA) 
received a signal from Philae. 

Scientists had hoped that as the comet 
neared the Sun, the craft’s solar panels would 
provide it with enough power to wake, but they 
also feared that the cold might prevent a reboot. 
The incoming data show that not only is Philae 
awake, but also that its prognosis is better than 
anyone had hoped. “Both temperature- and 
power-wise we are in a really good state,” says 
lander manager Stephan Ulamec of the Ger- 
man Aerospace Centre (DLR) near Cologne. 

The Rosetta scientists are itching to begin 
experiments. But first the communication 
link — which depends on a direct line-of-sight 
between the lander and the Rosetta mothership 
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currently orbiting the comet — has to improve. 
As Nature went to press, the communication 
bursts had been just a few minutes long, com- 
pared with three-hour stints in November, and 
were often interrupted, suggesting that Philae 
is not in the orientation that scientists had pre- 
dicted. It might be that the lander has moveda 
little, says Meierhenrich. But Ulamec reckons 
that the predictions — used to align Rosetta’s 
antenna — were “slightly wrong” to begin with. 
The team is piecing together how to shift 
Rosetta’s orbit to better align the two craft, 
something ESA plans to do on 17 June. “By the 
end of the week we really should be getting bet- 
ter, longer communication slots,’ says Ulamec. 
Philae’s 24 watts of available power is a “good 
amount’, says Koen Geurts, a member of 
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> the landing team at the DLR. “It’s more than 
enough to communicate and to do science 
activities.” And the lander’s reported operating 
temperature of -35°C should be warm enough 
for the team to attempt turning on the battery. 
“Then we are not relying on the short window 
when the lander is illuminated,” he adds. “We 
can do more things.” 


HOMING IN 

Assuming that the instruments have survived 
the cold, the first experiments will measure 
the comet’s magnetic and electric fields and 
temperature, says Ulamec. Next, Philae will 
start up its cameras, which were last deployed 
when Philae landed. “The surface might have 
changed, and we might see different features 
due to changing illumination,’ says Ulamec. 


The CONSERT radar instrument, which 
is designed to probe the comet's interior, will 
be used to pinpoint Philae’s exact location: 
although scientists knew that Philae had 
landed in November, they do not know exactly 
where it ended up, because it bounced twice. 

Eventually, Philae should deploy its drill to 
get samples from the comet’s crust, says Mei- 
erhenrich, who is a co-investigator on Philae’s 
chemical analyser, COSAC. 

With enough sunlight to recharge its bat- 
tery, Philae has potentially more power and 
more operating time than before hiberna- 
tion, meaning that it should now be able to do 
more-complex experiments. For example, it 
could use an infrared microscope to examine 
samples before they are chemically analysed, 
to verify that its drill did scoop something up. 


“We couldn't do this in the first science phase; 
we had to run the most simple system we 
could,’ says Meierhenrich. 

The comet is now a very different environ- 
ment from when Philae landed, with a much 
stronger outpouring of gas and dust. For Philae, 
the change of seasons is double-edged: there 
will be more particles in the atmosphere for the 
lander to ‘sniff; says Ulamec, but the particles 
pose a danger to Rosetta’s navigation systems, 
and will continue to intensify until August, 
when the comet will be closest to the Sun. To 
avoid damage, Rosetta might need to retreat to 
such a distance that it would no longer be able 
to communicate with Philae on the surface. 
But once the comet rounds the Sun in Septem- 
ber, Rosetta could come back in closer and let 
Philae phone home again. = 


Recep Tayyip Erdogan meets students days after the party he co-founded lost its parliamentary majority. 


Election results 
delight scientists 


Turkish researchers hope that a more pluralistic parliament 
will put an end to interference and slipping standards. 


BY ALISON ABBOTT 


cientists in Turkey are euphoric after a 
7 June election that stripped the mildly 


Islamic and increasingly repressive Jus- 
tice and Development Party (AKP) of its 
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absolute parliamentary majority. 

They hope that Turkey’s next parliament will 
reverse the creeping restrictions on academic 
freedom and the seeping away of scientific 
standards that have been a feature of the AKP’s 
12 years of political domination. These include 
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stripping public science organizations of their 
autonomy; allowing the teaching of creation- 
ism and astrology; and, most recently, a ban on 
social scientists interviewing political refugees. 

Many of the policies that scientists object to 
were pushed by Recep Tayyip Erdogan, who 
was prime minister between 2003 and 2014 and 
is now president. Although the AKP remains 
the largest party in parliament, it now needs a 
coalition partner to form a government. 

“With more pluralism in the parliament 
there are more prospects that some of the new 
rules imposed during the last years could be 
unpicked,” says Ali Alpar, an astrophysicist 
from Sabanc1 University in Istanbul. 

Turkish academics have never been entirely 
free, but the AKP has escalated government 
interference. For example, as in many European 
countries, university faculty members in Turkey 
vote for their rectors. However, the centralized 
higher-education council YOK then compiles 
a final shortlist, from which Turkey’s president 
selects the winner. The AKP had promised to 
abolish the unpopular council but ended up 
using YOK to promote its own ideology: most 
Turkish rectors are now AKP sympathizers. 

In the latest round of appointments in April, 
Erdogan caused consternation at Istanbul 
University by selecting historian Mahmut Ak 
as rector, instead of the left-leaning psychia- 
trist Rasit Tikel, who won most of the faculty’s 
votes and the favour of almost 14,500 people 
who petitioned Erdogan to pick Tikel. 

The People’s Democratic Party (HDP), a 
coalition of anti-nationalist, environmentalist 
and other leftist movements, won 13% of the 
vote in the election. It will use its influence in 
parliament to prioritize the freeing of academia 
from state control and to push for the abolition 
of YOK, says Gengay Giirsoy, a neurologist who 
won a seat in the new parliament for the HDP. 

“T can’t tell you how we will celebrate if 
YOK is finally abolished,” says Esra Mungan, a 
psychologist at Bogazici University in Istanbul 
and a member of the HDP advisory council. 


UMIT BEKTAS/REUTERS/CORBIS 
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“We are going to dance in the streets.” 

The AKP’s grip on science extends beyond 
YOK. In 2005, Erdogan began to place political 
loyalists in top posts at Turkey’s research-fund- 
ing agency TUBITAK, which had previously 
enjoyed a degree of autonomy. Many of them 
were from the Giilen movement — a trans- 
national religious and social organization. 
Scientists say that research funds were no 
longer distributed according to merit and that 
the agency has seemed to be anti-evolution. In 
2009, TUBITAK removed a portrait of Charles 
Darwin from a cover of a government-backed 
science magazine and sacked the editor (see 
Nature 458, 259; 2009). Under the AKP, which 
turned Turkey from a constitutionally secular 
nation to one where religion is state-sponsored, 
creationism is often taught in schools and 
debated in universities. (The AKP fell out with 
the Gillenists in 2013 and purged them from 
TUBITAK, leaving it in chaos.) 

In 2011, the science ministry assumed con- 
trol of the Turkish Academy of Sciences, TUBA. 
It decreed that TUBITAK and YOK would 
appoint two-thirds of TUBA’ members, who 
would then elect the remaining one-third. Most 
of TUBA original members resigned in protest 
and launched another national academy, Bilim 
Akademisi, which has regularly challenged sci- 
ence standards. 

Earlier this month, the academy, of which 
Alpar is president, identified two universities 
that had approved theses and certificates in 
astrology, and called on YOK not to allow such 
unscientific practices. It also contested a govern- 
ment order forbidding university researchers 
from interviewing refugees without govern- 
ment supervision. The academy argued that the 
government’ claim that the interviews would 
infringe data-protection laws was invalid. 

Until a new government forms, the AKP 
remains in charge. It irked scientists further 
on 11 June by appointing Ahmet Arif Ergin as 
head of TUBITAK. “It is extremely undemo- 
cratic and inappropriate for a caretaker gov- 
ernment to make such an important political 
appointment,’ says Mungan. 

But the election results have brought hope. 
“You can't imagine what a relief it has been in 
scientific circles — the one-man rule had looked 
unstoppable, but now it is over,’ says Sevket 
Ruacan, a pathologist at Koc University in Istan- 
bul and a former TUBITAK board member. 
Change will take time, he says, “but conserva- 
tive religious influences may now be reduced in 
education and it may become possible to reduce 
the political influences in scientific matters”. 

At least Turkish scientists have not wanted 
for funding. Research spending has more than 
trebled since Erdogan came to power — a result 
of his now-stalled endeavour for Turkey to be 
admitted to the European Union. Some major 
infrastructures have been developed, includ- 
ing the ambitious €60-million (US$68-million) 
Izmir Biomedicine & Genome Center, due to 
open in September. = 
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Researchers hope to find drugs that extend a person’s healthy years. 


CLINICAL RESEARCH 


Ageing pushed as 
treatable condition 


Regulators asked to consider innovative trial design. 


BY ERIKA CHECK HAYDEN 


octors and scientists want drug reg- 
D ulators and research funding agen- 

cies to consider medicines that delay 
ageing-related disease as legitimate drugs. 
Such treatments have a physiological basis, 
researchers say, and could extend a person's 
healthy years by slowing down the processes 
that underlie common diseases of age- 
ing — making them worthy of government 
approval. On 24 June, researchers will meet 
with regulators from the US Food and Drug 
Administration (FDA) to make the case for 
a clinical trial designed to show the validity 
of the approach. 

Current treatments for diseases related 
to ageing “just exchange one disease for 
another’, says physician Nir Barzilai of the 
Albert Einstein College of Medicine in New 
York. That is because people treated for one 
age-related disease often go on to die from 
another relatively soon thereafter. “What we 
want to show is that if we delay ageing, that’s 
the best way to delay disease.” 

Barzilai and other researchers plan to test 
that notion in a clinical trial called Target- 
ing Aging with Metformin, or TAME. They 
will give the drug metformin to thousands 
of people who already have one or two of 
three conditions — cancer, heart disease 
or cognitive impairment — or are at risk of 
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them. People with type 2 diabetes cannot be 
enrolled because metformin is already used 
to treat that disease. The participants will 
then be monitored to see whether the medi- 
cation forestalls the illnesses they do not 
already have, as well as diabetes and death. 
On 24 June, researchers will try to convince 
FDA officials that if the trial succeeds, they 
will have proved that a drug can delay ageing. 
That would set a precedent that ageing is a 
disorder that can be treated with medicines, 
and perhaps spur progress and funding for 


ageing research. 
“What we’re During a meeting 
trying to dois on 27 May at the US 
increasehealth National Institute 


on Aging (NIA) in 
Bethesda, Mary- 
land, Robert Temple, 
deputy director for clinical science at the FDAs 
Center for Drug Evaluation and Research, 
indicated that the agency is open to the idea. 
Barzilai and his colleagues eschew claims 
ofa quest for immortality, because they think 
that such assertions have led to a perception 
that the field is frivolous and irresponsible. 
“The perception is that we are all looking for a 
fountain of youth,” says Stephanie Lederman, 
executive director of the American Federation 
for Aging Research in New York. “We want to 
avoid that; what we're trying to do is increase 
health span, not look for eternal life” > 


span, not look 
for eternallife.” 
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> Ageing research has hit bumps in the past 
decade, as companies marketing drugs touted 
to prolong life have gone bust (see Nature 464, 
480-481; 2010). But organizers of the TAME 
trial think that the field is now in a better posi- 
tion because animal studies have shown that 
some drugs and lifestyle practices can extend 
life by targeting physiological pathways’. 

For instance, the NIA-sponsored Interven- 
tions Testing Program, in which investigators 
at three sites are systematically trialling can- 
didate age-delay treatments, has shown that 
a handful of interventions convincingly and 
reproducibly prolong the lives of various 
strains of mice. Those include cutting down 
on calorie intake and taking a drug called 
rapamycin that is used to prevent rejection 
of transplanted organs. 

And researchers from the Novartis Institutes 
for Biomedical Research in Cambridge, Mas- 
sachusetts, reported in December that elderly 
people develop a stronger immune response 
to an influenza vaccination if they also take 
a rapamycin-like drug”. Rapamycin, which 
acts on a biological pathway involved in cell 
growth, is now seen as one of the most prom- 
ising drugs for delaying ageing, but given over 
long periods of time it also suppresses the 
immune system. 


SAFETY FIRST 

The TAME test is for metformin, which sup- 
presses glucose production by the liver and 
increases sensitivity to insulin. The drug 
has been used for more than 60 years and is 
safe and prolongs healthy life and lifespan 
in worms’ and in some mouse strains’. Data 
also suggest that it could delay heart disease, 
cancer, cognitive decline and death in people 
with diabetes*. Plans call for the trial to enrol 
3,000 people aged 70-80 years at roughly 
15 centres around the United States. The trial 
will take 5-7 years and cost US$50 million, 
Barzilai estimates, although it does not yet 
have funding. 

Matt Kaeberlein at the University of 
Washington, Seattle, who is running a trial 
of rapamycin in elderly dogs, says that the 
concept behind Barzilai’s trial is sound. Even 
though other drugs might be more effective 
at delaying ageing in animal studies, he says, 
the many years of experience with metformin 
in people, combined with data suggesting that 
it impacts the ageing process in people, make 
it a good candidate for a first clinical trial in 
the field. 

“It’s a smart way to engage the FDA ina 
discussion about recognizing ageing as an 
indication that is appropriate for clinical 
trials,” Kaeberlein says. = 
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Infrared technologies are subject to tight restrictions because of their military applications. 


TECHNOLOGY 


US ‘export rules’ 
threaten research 


Proposed updates to national-security regulations would 
restrict collaboration with foreign scientists and industry. 


BY ALEXANDRA WITZE 


he US government is considering 

| policy changes that could dramatically 

affect how researchers handle equip- 

ment and information that have national- 

security implications. Among other impacts, 

scientists would need to reconsider what they 

can discuss with graduate students from 

other countries, or when travelling abroad 
on work trips. 

One set of rules would affect technolo- 
gies such as infrared detectors, which are 
commonly used for environmental and 
other types of monitoring but can fall 
under military restrictions because of their 
night-vision capabilities. Another set of 
proposed rules would revise the govern- 
ment definition of “fundamental research” 
in ways that could affect any scientist who 
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collaborates with industrial partners. 

The proposals are part ofa long-running 
effort to overhaul Byzantine regulations 
that seek to prevent certain information 
and technologies from reaching countries 
perceived to be hostile by the United States. 
These ‘export control’ rules can create a 
major — even perilous — headache for US 
researchers. In 2012, a plasma engineer at 
the University of Tennessee in Knoxville 
began serving a four-year jail sentence for 
violating these rules by working with gradu- 
ate students from China and Iran while on a 
US Air Force contract. 

Public-comment periods on the proposed 
rules close on 6 July and 3 August, and science 
groups are trying to raise awareness of what is 
at stake. “Once the rules get put in place, it’s 
very hard to dial back if you find them overly 
aggressive, says Gregory Quarles, a physicist 
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and board member of the Optical Society, a 
professional organization in Washington DC 
that is encouraging its members to study the 
rules and weigh in with any concerns. 

Export control covers both physical equip- 
ment that leaves the United States — such as 
spacecraft components on their way to a launch 
site, or a laptop computer carrying sensitive 
information to a conference — and informa- 
tion conveyed to someone from another coun- 
try. There are two parallel lists that describe 
which items are controlled, one maintained 
by the US Department of State and the other 
by the Department of Commerce. Over the 
past five years, regulators have been working 
to relax the system, moving some items from 
the restrictive state-department list — com- 
monly known as ITAR, for International Traf- 
fic in Arms Regulations — to the less tightly 
controlled commerce-department list. But they 
have left some of the hardest topics for last. 

On 5 May, the departments released the first 
set of long-awaited proposals for regulating the 
category that includes sensors and night-vision 
equipment. Many researchers hoped these 
would loosen current regulations that they say 
have impeded research, but that does not seem 
to be the case. The rules specify performance 
parameters, such as wavelengths for infrared 
cameras, that would continue to restrict com- 
mercially available sensors that are popular 


with many researchers for environmental, 
medical and other applications. 

At the University of Rochester in New York, 
astronomer Judith Pipher develops infrared 
detectors to observe the cosmos, which are 
regulated under ITAR because military night- 
vision goggles also use infrared sensors. Astro- 
nomical sensors detect radiation that is one 
million times less intense than that detected by 

the military, but nei- 


“We’ve heard ther the current nor 
from at least a the proposed rules 
fewinstitutions take that into account. 
that this will “I dont think this par- 
totally stifle ticular reform is going 
their ability to get us there,” says 
to work with Pipher, who, before 
industry.” she acquired US 


citizenship, was once 
barred from seeing data gathered by infrared 
instruments that her team was testing. 

The second set of proposed rules could 
have wider-reaching consequences. It aims to 
harmonize state and commerce department 
definitions of terms such as “fundamental 
research” and “public domain”. 

In general, the rules state that information 
developed through fundamental research is 
considered to be in the public domain and 
not subject to ITAR controls. But one newly 
added phrase says that if the research is subject 
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to proprietary review before being published, 
such as by a company that sponsored the work, 
then it falls under ITAR. 

“We've heard from at least a few institu- 
tions that this will totally stifle their ability 
to work with industry,’ says Tobin Smith, 
vice-president for policy at the Association of 
American Universities in Washington DC. 

Any university researcher who has a con- 
tract or sponsored research agreement with 
a company could face a huge load of paper- 
work. “It will fundamentally change the way we 
do business,” says Elizabeth Peloso, associate 
vice-president and associate vice-provost for 
research services at the University of Pennsyl- 
vania in Philadelphia. 

The new rules would add some needed 
changes, she adds, such as clarifying that a US 
researcher can e-mail data to a collaborator 
elsewhere in the world, as long as those data 
are encrypted. 

After the comment periods close, the state 
and commerce departments will either issue 
another round of interim rules for more feed- 
back, or publish whatever final regulations 
they decide on. 

“You asked for reforms, so now you got 
it,’ says Kevin Wolf, who oversees the export 
control reform effort at the commerce depart- 
ment. “If you’re an export nerd, now is an 
exciting time.” m 
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Graphene booms in factories 
but lacks a killer app 


Although the wonder material is being made in record volume, commercial success is elusive. 


Graphene films can be used as transparent electrodes for touchscreens. 


BY MARK PEPLOW 


he city of Manchester, UK, is gearing 

| up for a graphene jamboree. Graphene 

Week 2015, which kicks off on 22 June, 

is sure to delight its more than 600 attendees 

with a conference and celebrations of the 

‘wonder material’ Graphene’s commercial 
future, however, is much less certain. 

The atom-thin flakes of carbon are being 
produced in record volume and have found 
their way into a handful of eye-catching giz- 
mos. But experts fret that graphene production 
far exceeds requirements, and that the material 
offers only marginal benefits over incumbent 
technologies in many of its target applications. 

“There's a heck of a lot of production capacity 
and not much demand, because we just haven't 
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seen any compelling technologies coming 
through,’ says Ross Kozarsky, a senior analyst 
at market-intelligence company Lux Research, 
who is based in San Francisco, California. 
This mirrors the trajectory of carbon nano- 
tubes, which were once touted as transforma- 
tive but have so far failed to make a significant 
commercial impact. “Graphene looks much 
closer to the next carbon nanotube than the 
next silicon,” says Kozarsky. In a 2014 report, 
Lux predicted that the global market for gra- 
phene would be worth US$349 million by 2025; 
by comparison, the University of Manchester 
estimates that graphene has already attracted 
$2.4 billion for research (see ‘Market growth). 
Some scientists are growing frustrated that 
companies are not lining up outside their 
offices begging to use the material, says Helena 
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Theander, innovation deputy at the European 
Union's €1-billion (US$1.1-billion) Graphene 
Flagship research programme, which was set 
up in 2013 to usher graphene from the lab to 
the marketplace within a decade. 
Researchers have been in thrall to gra- 
phene’s properties ever since the material was 
first isolated by the University of Manches- 
ter’s Andre Geim and Konstantin Novoselov 
(K. S. Novoselov et al. Science 306, 666-669; 
2004) — who won the 2010 Nobel Prize in 
Physics for their work. Graphene is transpar- 
ent and flexible, yet very strong; electrons 
zoom through it; and it conducts heat ten 
times better than copper. Enthusiasts expect 
these qualities to propel graphene into dozens 
of commercial applications, from flexible elec- 
tronics to strong, lightweight composites. 


GRAPHENE GLUT 

This has sparked a graphene-making frenzy, 
particularly in China (see “Manufacturing 
fever’), where the government has poured 
investment into some half a dozen ‘graphene 
industry parks, which host research institutes 
and business incubators side by side. Manufac- 
turers enjoy reduced infrastructure costs, and 
are not expected to repay government grants. 
The graphene mass-produced in China falls 
broadly into two categories: thin films that can 
be used as transparent electrodes for touch- 
screens, for example, replacing the brittle 
indium tin oxide (ITO) used today; and nano- 
platelets, stacks of graphene flakes that form a 
black powder targeted at applications such as 
battery electrodes, in which their high surface 
area may help to store more charge. 

Products are starting to appear. Last year, 
AWIT in Wuxi, China, brought 2,000 gra- 
phene-touchscreen phones to market, and in 
March, Galaxy Microsystems of Shenzhen, 
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China, released 30,000 phones that use 
graphene in their touchscreens, cases and 
batteries (see ‘Batteries and phones rule’). But 
Kozarsky says that some nanoplatelet produc- 
ers are selling their product at below cost to 
offload their surplus. And although graphene- 
film factories have quoted impressive-sounding 
figures on capacity, these are sometimes used 
as a “marketing tool’, says Guocai Dong, vice- 
dean of the JiangNan Graphene Research Insti- 
tute, which is in the Changzhou graphene park. 
“Most of the capacity is not used,” he says. 
One reason for the mismatch is that gra- 
phene films still do not beat ITO on price. The 
films are grown on hot copper foil from a feed- 
stock of methane gas mixed with hydrogen, in 
a process called chemical vapour deposition 
(CVD). Dong says that heating the foil, main- 
taining clean rooms and separating graphene 
from the foil tend to push the final cost above 
400 yuan (US$64) per square metre, about 
twice what it needs to be to compete with ITO. 
And although graphene’ flexibility could add 
value to wearable devices, the wonder material 
will have to beat off other pretenders to ITO’s 
throne, including silver-nanowire meshes, 
which have been in development for longer. 
A new approach to manufacturing is needed 
to depress costs further, says Xuesong Li, who 
helped to develop the copper-foil CVD method 
(X. Li et al. Science, 324, 1312-1314; 2009). Last 
year, Li founded LasLumin, a company in New 
York City that aims to slash graphene manufac- 
turing costs by using feedstocks that are easier 
to handle than the potentially explosive mix- 
ture of methane and hydrogen. Graphene films 
may find more-profitable applications in inno- 
vative chemical sensors, rather than in taking 
onan established material such as ITO, he adds. 


FROM LAB TO MARKET 

China also dominates the graphene patent 
landscape (see ‘Patent kings’). SIO Grafen, 
an innovation programme in Gothenburg, 
Sweden, that supports industrial graphene 
development, reported in March that Chinese 
entities hold 45% of all graphene patents 
granted in the seven categories that it surveyed. 
That is more than twice the share of China’s 
nearest rival, the United States. However, most 
of the Chinese patents are registered nationally 
rather than globally, and they tend to be cited 
less often than the US ones, suggesting that 
they may not have much impact on graphene's 
long-term commercial development. 

Enter the Graphene Flagship, which plans 
to focus on quality and innovation. With that 
in mind, Theander has run seven Graphene 
Connect workshops to introduce small and 
medium-sized companies to graphene research- 
ers, and will host another during Graphene 
Week. But she says it is too soon for concrete 
results: “Growing trust and new relationships 
takes time, so I imagine it will take a couple of 
years until we have true success stories.” 

A similar approach has proved successful 
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Graphene production capacity has exploded in Asia, driven mainly by China. 
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Patent kings 


China dominates the graphene patent landscape in key fields, but its 
patents are overwhelmingly domestic and may have little global impact. 
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Batteries and phones rule 
The most patents filed for graphene 
applications concern energy storage 
and composite materials. 
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at the University of Manchester’s National 
Graphene Institute (NGI), says its business 
director, James Baker. The NGI has almost 40 
industrial partners who have funded work at 
the university; the biggest, BGT Materials in 
Manchester, has a subsidiary called Graphene 
Lighting that unveiled a ‘graphene light bulb’ 
in March. The bulb is actually a conventional 
light-emitting diode (LED) coated with a 
transparent film of graphene that draws off 
heat more effectively than conventional metal 
components, improving the efficiency and lon- 
gevity of the LED. The company hopes that the 
bulb will go on sale in the next few months. 

Meanwhile, Baker notes, multinational 
sports-equipment company Head has unveiled 
arange of tennis rackets that are strengthened 
by a dash of graphene in their frames. 

Such niche applications are a start, but if 
graphene is to fulfil the commercial potential 
anticipated by projects such as the Graphene 
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Market growth 


Investment committed to graphene 
research so far dwarfs even the raw 
material’s projected sales in 2025. 
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Flagship, it will need to find a role in which it 
dramatically and reliably outperforms existing 
technologies at a reasonable cost. 

Baker believes that the answer lies in 
creating devices that rely on a combination 
of graphene’s properties — flexible electron- 
ics, for example — or that could have a trans- 
formative global impact, such as in membranes 
that desalinate water. But it is early days, he 
says: “There are so many potential applications 
that it’s difficult to see what will be the killer 
app: ” m SEE NEWS FEATURE P.274 AND BOOKS & ARTS P.284 


CORRECTION 

The News story ‘Start-ups fight for a place 
in Boston's biotech hub’ (Nature 522, 138- 
139; 2015) misstated prices for lab and 
office space in Kendall Square; the figures 
reflect annual rather than monthly rents. 
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Microbiologists are finding new ways to explore the vast 
universe of unknown microbes in the hunt for antibiotics. 


BY CORIE LOK 
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he first time Robert Heinzen tried 

to get Coxiella burnetii to grow 

by itself, he failed miserably. The 

bacterium, which causes an influ- 

enza-like illness called Q fever, 
normally divides only inside the cells it infects 
— forcing researchers to grow it in mammalian 
tissue and hampering their efforts to investi- 
gate the microbe. When Heinzen tried to find 
a different way to culture it during his time as 
a postdoc in the early 1990s, he emerged with 
only half book of scribbled notes. 

But the problem kept nagging at him until 
2003, when the C. burnetii genome was 
sequenced’ and he was starting a lab at the US 
National Institutes of Health’s Rocky Mountain 
Laboratories in Hamilton, Montana. Heinzen 
thought that the genome could offer impor- 
tant clues to the bacterium’s metabolism and 
growth. Even so, it took his postdoc Anders 
Omsland almost four years of systematically 
testing hundreds of combinations of culture 
conditions to come up with the perfect rec- 
ipe for cultivating the microbe outside cells’. 
“When he showed me the cultures, I thought, 
it’s got to be a contaminant,’ Heinzen recalls. 
But several more months of work confirmed 
their success. 

Coxiella burnetii is still in the minority. An 
estimated 85-99% of bacteria and archaea 
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cannot yet be grown in the lab, drastically lim- 
iting scientists’ knowledge of microbial life and 
holding back the search for new antibiotics, 
which tend to be derived from bacteria. That 
search is becoming more urgent as resistance 
to existing drugs surges: last month, the World 
Health Organization approved a global plan 
to combat antibiotic resistance, and a review 
panel appointed by the UK government called 
for a £1.3-billion (US$2-billion) investment 
from the global drug industry to revitalize anti- 
biotic research. To find new drugs, researchers 
say that they need alternative ways to investi- 
gate the array of uncultured organisms — the 
mysterious dark matter of the microbial world. 

Scientists are already taking steps towards 
this goal. Advances in cultivation methods 
and other technologies have helped them to 
grow previously unculturable microbes, and 
improved DNA sequencing and bioinformatics 
are allowing them to examine some microbes 
without needing to grow them at all. The work 
has uncovered a breathtaking amount of micro- 
bial diversity in samples ranging from soil to 
permafrost, marine sponges, hydrothermal 
vents and the crevices of the human body. Some 
of the discoveries are already pointing to pos- 
sible antibiotics — and scientists say that they 
are only just scratching the surface. 

“There is, for sure, high potential for 
more biodiversity to be discovered,’ says Ute 
Hentschel, a marine microbiologist at the Uni- 
versity of Wiirzburg in Germany. “If you look 
for more, you'll find more” 


Conventionally, biologists have studied 
microbes by growing pure cultures of a species 
in fairly standard sets of nutrients. The trouble 
is that bacteria do not live like that in nature: 
they inhabit a huge range of environments, 
usually alongside other organisms, and sci- 
entists have struggled to recreate those condi- 
tions. But as Heinzen and Omsland showed 
with their studies on C. burnetii, genetic 
sequences can throw open a door. 

Omsland used sequencing to compare the 
genes expressed when the bacteria were grow- 
ing successfully inside host cells with those 
expressed when they were struggling to grow 
alone. He found a suite of genes involved in 
protein synthesis that were less active in the 
struggling microbes, a hint that adding amino 
acids and peptides to the growth medium 
might help the bacterium to thrive. But even 
when Omsland managed to increase the bac- 
terium’s protein synthesis 13-fold, it still would 
not divide’. 

The final clue came from genes suggesting 
that C. burnetii could survive in low-oxygen 
environments. When the team placed the 
microbe in 5% oxygen or less, they finally saw 
it grow. “That was the critical finding,” says 
Heinzen. “It wasnt a nutrient, it was an envi- 
ronmental factor.” 

Since adopting the new ‘axenic’ or 
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“THAT WAS THE 
CRITICAL FINDING. 
IT WASN'T A 
NUTRIENT, IT WAS 
AN ENVIRONMENTAL 
FACTOR.” 


host-cell-free culture technique, the 
C. burnetii field has expanded. By selectively 
turning genes on and off, researchers have 
learned about how the bacterium interacts 
with host cells to infect them and divide. “The 
ability to grow Coxiella axenically has, with- 
out any exaggeration, completely revolution- 
ized this field of study,” says Hayley Newton, a 
microbiologist and Coxiella researcher at the 
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University of Melbourne in Australia. The 
bacterium is highly transmissible through 
air and is considered a possible biothreat. 
Heinzen’s lab is now working on making 
strains in which key virulence genes have 
been inactivated, in the hope that they might 
be useful in developing vaccines. 
Researchers are now designing culture sys- 
tems for other microbes that grow only inside 
cells. Omsland, now at Washington State Uni- 
versity in Pullman, has developed a cell-free 
culture system for Chlamydia trachomatis’, 
the pathogen behind one of the most common 
sexually transmitted diseases. He has not yet 
coaxed Chlamydia to divide in his medium, 
but “I was born optimistic,” he says — and his 
success with C. burnetii fuels his hope. 


One way to speed up the process of finding 
a culture recipe is to use microfluidic chips 
— devices with thousands of tiny wells con- 
nected by channels that make it possible to run 
many experiments in parallel. After using this 
method to cultivate a new microbe’, Rustem 
Ismagilov at the California Institute of Tech- 
nology in Pasadena and his collaborators even 
named the bacterium isolate microfluidicus 1. 
Ismagilov was already working on 
microfluidics when, in 2012, a group of micro- 
biologists issued a list of ‘most wanted’ taxa 
—acall to the research community to grow 
and sequence microbes that were relatively 
common in the human body, were distantly 
related to already-sequenced organisms and 
had eluded all attempts at cultivation’. 
Ismagilov and his team answered the call 
with a device that holds 3,200 nanolitre-sized 
wells and that can fit in the palm of a hand. 
They scraped samples from the gut lining of 
a healthy volunteer, and then diluted them 
so that no more than one cell would end up 
in each well. By filling so many wells, the 
researchers increased the chances that their 
target organism — a human-gut microbe in 
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the Oscillibacter genus — would find its way 
into at least a few of them. The team used about 
ten chips to test various conditions, and looked 
for growth of the microbe by checking its DNA 
for a key marker gene. 

They managed to find their bacterium, and 
then grow it to larger amounts in Petri dishes. 
It was one of the first members of the wanted 
list to be cultivated. Further genetic study 
revealed that isolate microfluidicus 1 was not 
actually part of the Oscillibacter genus; it had 
been classified incorrectly and was actually 
part of a new, related group that the team is 
now working to characterize. 

A key ingredient for growing this bacterium, 
the team found, was a dash of fluid that had 
been extracted from the volunteer’s intestine. 
Being able to stretch the use of such a precious 
sample across thousands of experiments is 
an important advantage of the microfluid- 
ics approach, says Ismagilov. Another is that 
each starting cell does not have to compete 
with other species. “Microfluidics allows us 
to identify culture conditions efficiently and 
then increase our chances that our target will 
grow,’ he says. 

Xiaoxia Nina Lin, a chemical engineer at 
the University of Michigan in Ann Arbor, is 
using microfluidics to hunt down members of 
the most-wanted list in human faecal samples. 
Microbes normally live in complex commu- 
nities and often rely on other species, so Lin 
is trying to dissect those relationships by put- 
ting two, three or four cells together in myriad 
combinations on a chip, and working out who 
is dependent on whom. “It’s a good engineering 
approach,’ says Vincent Young, an infectious- 
disease researcher at the University of Michigan 
who is helping Lin to obtain clinical samples. 
“You can quickly reduce the complexity.” 


When Slava Epstein and Kim Lewis started to 
collaborate 15 years ago, they realized that they 
might not need to coax recalcitrant microbes 
into growing in the lab. Ifa bacterium already 
grows happily in its natural environment, they 
reasoned, then why not just cultivate it there? 
So the two microbiologists, from Northeastern 
University in Boston, Massachusetts, started 
working on a simple device that they could stick 
in the ground. They called it the iChip. 

The approach paid off earlier this year, when 
Lewis, Epstein and scientists from their start- 
up company NovoBiotic Pharmaceuticals in 
Cambridge, Massachusetts, reported that they 
had used the iChip to isolate a new bacterial 
species from soil®. The thumb-sized device is 
less sophisticated than a microfluidics chip: it 
consists of 384 tiny wells that were filled with 
samples of soil that have been mixed with agar 
and diluted to ensure that only one cell ends 
up in each chamber. The chip is sealed with a 
membrane that traps the bacteria but allows 
molecules to diffuse back and forth, and was 
planted in a grassy field in Maine — the same 
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soil from which the sample had been taken. 
After a month, the researchers transferred 
colonies from the chip to Petri dishes in the lab, 
took extracts from them and screened those for 
antibiotic activity. They had grown 10,000 types 
of bacterium — many more than if they had 
just put the soil sample on an agar plate. They 
homed in on a new species that they called 
Eleftheria terrae’ and found that the bacterium 
produces an antibiotic, called teixobactin, 
that kills various human pathogens in the lab, 
including drug-resistant strains of Staphylococ- 
cus aureus. “To me that’s a phenomenal result, 
that they can find really neat new molecules 
from groups of organisms that pharma largely 
hasn't focused on,’ says Sean Brady, a chemical 
biologist at Rockefeller University in New York. 
But what generated headlines was the discov- 
ery that other bacteria did not develop resist- 
ance to teixobactin®, as they do to most other 
antibiotics. That is because teixobactin binds 
to molecules that have important roles in cell- 
wall synthesis; bacteria are not known to modify 
these molecules to evade the effects of antibi- 
otics. The clincher, Lewis says, is that although 
E. terrae is inherently resistant to the teixobac- 
tin, it does not seem to have resistance genes 
that could be transferred easily to other bacteria. 
This does not mean that resistance will never 
emerge, but that it could take 20 or 30 years. 
The NovoBiotic team went on to grow larger 
quantities of the bacterium. It is now generat- 
ing grams of the drug using a fermenter, doing 
extensive preclinical testing of this and other 
drug candidates, and seeking more leads from 
uncultivated microbes in soil and marine sam- 
ples. Epstein is using the iChip to culture new 
microbes from soil and water in Greenland, 


© 2015 Macmillan Publishers Limited. All rights reserved 


and says that he has received more than 
200 requests this year for the device and advice 
on how to use it. 


DON’T CULTURE, SEQUENCE 

Despite these successes, culturing microbes 
is still a complicated, hit-and-miss affair, so 
many researchers are bypassing it altogether 
and instead learning what they can from DNA. 
Advances in sequencing methods mean that 
scientists can now analyse genomes from indi- 
vidual uncultured microbial cells — rather than, 
as before, typically sequencing a community of 
many different types of microbe en masse and 
then trying to piece the sequences back together. 

Tanja Woyke at the US Department of Ener- 
gy’s Joint Genome Institute in Walnut Creek, 
California, first got interested in single-cell 
sequencing not long after the key discovery ten 
years ago that an enzyme from a bacteria-infect- 
ing virus could be used to make many copies 
of a bacterial cell’s genome’. Woyke wanted to 
use the tools to fill out the microbial tree of life. 

She and her group collected samples from 
nine different habitats, including sediment 
from a Nevada hot spring and water near a 
Pacific hydrothermal vent. They isolated some 
200 cells, sequenced the genome of each one 
and classified the cells into more than 20 new 
lineages that do not have any cultivated repre- 
sentatives®. “They were the first to really take 
single-cell genomics to the next level, in terms 
of the number of sequences and single cells ana- 
lysed? says Hentschel. 

Last year, Jorn Piel of the Swiss Federal Insti- 
tute of Technology in Zurich and his colleagues 
reported that they had used single-cell sequenc- 
ing and other techniques to identify uncultured 
bacteria in marine sponges’. These filter-feed- 
ing creatures have long been of interest to scien- 
tists because they produce a rich set of chemicals 
with anticancer, antibiotic and other medicinal 
properties. They also harbour dense microbial 
communities that contribute up to 40% of the 
sponge’s mass and were suspected to be the 
source of these chemicals. But the members of 
those communities had not been cultured. 

Piel and his group focused on the sponge 
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“THEY WERE THE 
FIRST TO REALLY 
TAKE SINGLE-CELL 
GENOMICS TO THE 
NEXT LEVEL.” 


Theonella swinhoei, which harbours about 
1,000 types of bacterium and generates doz- 
ens of known bioactive compounds. In 2011, 
they started to sequence DNA from individual 
bacterial cells isolated from sponge samples 
and looked for two gene clusters known to 
be involved in the production of biologically 
active molecules. They found these genes in a 
bacterium called Entotheonella’. 

What was most surprising to Piel, however, 
was that this one organism was responsible for 
nearly all of the bioactive compounds linked 
to the sponge — something that became clear 
when sequence data showed that the bacterium 
harboured all the necessary genes. When Piel 
received the key data from his collaborators, “I 
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almost fell out of my chair,’ he says; it was the 
first evidence that an uncultivated microbe 
can be sucha ‘talented’ producer of bioactive 
chemicals. “The ability to create many distinct 
compounds in a single strain, this is not that 
common,’ he says. 

Piel’s lab is now trying to engineer gene clus- 
ters from Entotheonella into a culturable organ- 
ism such as Escherichia coli so that the host can 
churn out the compounds, something that is not 
likely to be easy given that biosynthetic genes 
can be large. He is also mining the genomes of 
microbes in sponges from Japan, Papua New 
Guinea and Israel in search of other bacterial 
super-producers. 


Michael Fischbach, a biochemist at the Univer- 
sity of California, San Francisco, has developed 
a different way to analyse microbial sequences: 
rather than isolating single cells, he sifts through 
the growing banks of bacterial genomic data. 

Fischbach and his group developed a 
machine-learning algorithm that is trained to 
recognize key patterns associated with bacterial 
genes that synthesize interesting molecules such 
as antibiotics. Then they let it loose on a large 
set of bacterial genomes to look for new gene 
clusters with similar features. 

Their targets included bacteria from soils and 
oceans, which are known to harbour an incred- 
ible diversity of microbes. But the algorithm 
generated a surprising number of hits from 
microbes that live on or in the human body, 
known collectively as the human microbiota. 
Fischbach was both excited and intimidated 
when he got the first results. Excited, because 
the bioactive compounds made by the micro- 
biota were largely uncharted territory, and could 
have important roles in human health and dis- 
ease. Intimidated because Fischbach mostly 
worked on microorganisms in soil. 

Still, Fischbach decided to take the plunge, 
and has since switched his entire lab’s focus to 
the human microbiome. Using an improved 
version of the algorithm, his team went pros- 
pecting in the genomes of almost 2,500 organ- 
isms in the human body”, and found more than 
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14,000 biosynthetic gene clusters. “It has been 
remarkably easy to find interesting molecules 
from the human microbiome,’ says Fischbach. 
“It's so easy that I can tell it has nothing to do 
with our ability to find them, it’s more that 
there’s just a lot to find” 

The group narrowed down the list to more 
than 3,000 common gene clusters, and found 
that one generated an antibiotic, lactocillin, 
made by a microbe commonly found in the 
vagina. It is one of only a handful of bioactive 
chemicals isolated from the human microbiota. 
Lactocillin blocks the growth of common vagi- 
nal pathogens such as S. aureus, but not of other 
bacteria that normally inhabit the vagina. Fisch- 
bach is now generating the molecules from the 
gene clusters he is finding, solving their struc- 
tures, and working with collaborators to learn 
more about their function. 

The natural-products field has tended to 
focus on soil and marine microorganisms 
rather than on human ones, says Gerry Wright, 
a chemical biologist at McMaster University 
in Hamilton, Canada. “I think it’s a great idea 
to look at those genes and clusters,” he says. 
However, turning such compounds into usable 
drugs will require a lot of preclinical work, 
says Wright. “Just by looking at a molecule, it’s 
almost impossible to tell whether it’s going to be 
suitable as a drug.” And even ifit seems promis- 
ing, the barriers to commercializing a new anti- 
biotic are high (see M. Woolhouse and J. Farrar 
Nature 509, 555-557; 2014). 

But Lewis takes hope from the recent pro- 
gress. With all the burgeoning efforts to culture 
and analyse uncultured microbes, he is already 
imagining what could be discovered if such 
efforts were scaled up. He hopes to see a level 
of drug discovery to match that in the Waks- 
man era, the time in the 1940s and 1950s when 
Nobel-prizewinning microbiologist Selman 
Waksman discovered more than 20 antibiotics 
by systematically screening thousands of soil 
microbes for their ability to block the growth 
of other bacteria. 

“The fact that we're finding compounds with 
such remarkably interesting modes of action 
that we haven't seen before, that’s the most 
interesting part,’ says Lewis. “We've scratched 
only a tiny part of Mother Earth” a 


Corie Lok is Nature’ Research Highlights 
editor. 
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A wave of innovative flat materials is following in the 
wake of graphene — but the most exciting applications 
could come from stacking them into 3D devices. 


BY ELIZABETH GIBNEY 


hysicists have used almost every superlative they can think of to 

describe graphene. This gossamer, one-atom-thick sheet of carbon 

is flexible, transparent, stronger than steel, more conductive than 

copper and so thin that it is effectively two-dimensional (2D). No 
sooner was it isolated in 2004 than it became an obsession for research- 
ers around the world. 

But not for Andras Kis. As miraculous as graphene was, says Kis, 
“T felt there had to be more than carbon.” So in 2008, when he got the 
chance to start his own research group in nanoscale electronics at the 
Swiss Federal Institute of Technology in Lausanne (EPFL), Kis focused 
his efforts on a class of super-flat materials that had been languishing 
in graphene’s shadow. 

These materials had an ungainly name — transition-metal dichal- 
cogenides (TMDCs) — but a 2D form that was quite simple. A single 
sheet of transition-metal atoms such as molybdenum or tungsten was 
sandwiched between equally thin layers of chalcogens: elements, such 
as sulfur and selenium, that lie below oxygen in the periodic table. 
TMDCs were almost as thin, transparent and flexible as graphene, 


says Kis, but “somehow they got a reputation as not that interesting. I 
thought they deserved a second chance.” 

He was right. Work by his team and a handful of others soon showed 
that different combinations of the basic ingredients could produce 
TMDCs with a wide range of electronic and optical properties. Unlike 
graphene, for example, many TMDCs are semiconductors, meaning that 
they have the potential to be made into molecular-scale digital processors 
that are much more energy efficient than anything possible with silicon. 

Within a few years, laboratories around the world had joined the 2D 
quest. “At first it was one, then two or three, and suddenly it became 
whole zoo of 2D materials,’ says Kis. From a scattering of publications in 
2008, 2D TMDCs alone now generate six publications each day. Physi- 
cists think that there may be around 500 2D materials, including not 
just graphene and TMDCs, but also single layers of metal oxides, and 
single-element materials such as silicene and phosphorene. “If you want 
a 2D material with a given set of properties,’ says Jonathan Coleman, a 
physicist at Trinity College Dublin, “you will find one” 

Ironically, one of the most exciting frontiers in 2D materials is 
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SOURCE: H. TERRONES ETAL. SCI. REP. 3, 1549 (2013) 


Stacks of multiple kinds of flat 
materials can exploit the best 
properties of each. 


stacking them into structures that 
are still very thin, but definitely 3D. 
By taking advantage of the vastly dif- 
ferent properties of various super-flat 
materials, it should be possible to build entire digital circuits out of 
atomically thick components, creating previously unimagined devices. 
Applications are already being touted in fields from energy harvesting to 
quantum communications — even though physicists are just beginning 
to learn the materials’ potential. 

“Each one is like a Lego brick,” says Kis. “If you put them together, 
maybe you can build something completely new.” 


ADVENTURES IN FLATLAND 

A material that is just a few atoms thick can have very different funda- 
mental properties from a material made of the same molecules in solid 
form. “Even if the bulk material is an old one, if you can get it into 2D 
form it opens up new opportunities,” says Yuanbo Zhang, an experimen- 
tal condensed-matter physicist at Fudan University in Shanghai, China. 

Carbon is the classic example, as physicists Andre Geim and 
Konstantin Novoselov found in 2004 when they first reported isolat- 
ing graphene’ in their laboratory at the University of Manchester, UK. 
Their technique was almost absurdly simple. The basic step is to press 
a strip of sticky tape onto a flake of graphite, then peel it off, bringing 
with it a few of the atom-thick layers that make up the bulk material. 
By repeating this process until they had single layers — which many 
theorists had said could not exist in isolation — Geim and Novoselov 
were able to start investigating graphene’s remarkable properties. Their 
work won them the 2010 Nobel Prize in Physics. 

Physicists were soon hurrying to exploit those properties for appli- 
cations ranging from flexible screens to energy storage (see page 268). 
Unfortunately, graphene proved to be a poor fit for digital electronics. 
The ideal material for that application is a semiconductor — a material 
that does not conduct electricity unless its electrons are boosted with a 
certain amount of energy from heat, light or an 
external voltage. The amount of energy needed 
varies with the material, and is known as the 
band gap. Turning the material's conductivity 
on and off creates the 1s and 0s of the digital 
world. But graphene in its pure form does not 
have a band gap — it conducts all the time. 

Still, Geim and Novoselov’s success in 
making graphene spurred them, Kis and many 
others to start investigating alternative 2D 
materials that could have a band gaps’. They 
began with TMDCs, which had been studied 
in bulk form since the 1960s. By 2010, Kis’s team had built its first sin- 
gle-layer transistor’ from the TMDC molybdenum disulfide (MoS,; 
see ‘Flat-pack assembly’), speculating that such devices could one day 
offer flexible electronics whose small component size and low voltage 
requirements would mean that they consumed much less power than 
conventional silicon transistors. Being semiconducting was not their 
only advantage. Studies in 2010 showed that MoS, could both absorb 
and emit light** efficiently, making it — and probably other TMDCs — 
attractive for use in solar cells and photodetectors. 

A single layer of TMDCs can capture more than 10% of incom- 
ing photons, an incredible figure for a material three atoms thick, says 
Bernhard Urbaszek, a physicist at the Physics and Chemistry of Nano- 
Objects Laboratory in Toulouse, France. This also helps them in another 
task: converting light into electricity. When an incoming photon hits the 
three-layer crystal, it boosts an electron past the band gap, allowing it 
to move through an external circuit. Each freed electron leaves behind 
akind of empty space in the crystal — a positively charged ‘hole’ where 
an electron ought to be. Apply a voltage, and these holes and electrons 
circulate in opposite directions to produce a net flow of electric current. 

This process can also be reversed to turn electricity into light. If elec- 
trons and holes are injected into the TMDC from an outside circuit; 


“PEOPLE SAID 
IT WAS INSANE 
AND WOULD 
NEVER WORK.” 
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when they meet, they recombine and give up their energy as photons. 

This ability to convert light to electricity and vice versa makes TMDCs 
promising candidates for applications that involve transmitting informa- 
tion using light, as well as for use in tiny, low-power light sources and even 
lasers. This year, four different teams demonstrated the ultimate control 
over light emission, showing that the TMDC tungsten diselenide (WSe,) 
could absorb and release individual photons®’. Quantum cryptography 
and communications, which encode information in one photon ata time, 
need emitters like this, where you “press a button and get a photon now, 
says Urbaszek. Existing single-photon emitters are often made of bulk 
semiconductors, but 2D materials could prove smaller and easier to inte- 
grate with other devices. Their emitters are necessarily on the surface, 
which could also make them more efficient and easier to control. 

Even as researchers were getting to grips with TMDCs, theorists were 
seeking other materials that could be engineered in two dimensions. 
One obvious candidate was silicon, which sits right below carbon in the 
periodic table, forms chemical bonds ina similar way, has a natural band 
gap and is already widely used in the electronics industry. Calculations 
suggested that, unlike graphene, a sheet of atomically thick silicon would 
have a ridged structure that could be squashed and stretched to create 
a tunable band gap. But like graphene, this ‘silicene’ would be a much 
faster conductor of electrons than most TMDCs. 

Unfortunately, theory also suggested that a 2D sheet of silicene would 
be highly reactive and completely unstable in air. Nor could it be ripped 
from a crystal like other 2D materials: natural silicon exists only ina 
3D form analogous to a diamond crystal, with nothing resembling the 
layered sheets of carbon found in graphite. 

“People said it was insane and would never work,” says Guy Le Lay, a 
physicist at Aix-Marseille University in France. But Le Lay, who had been 
growing metals on silicon surfaces for years, saw a way to make silicene by 
doing the reverse — growing atomically thin sheets of silicon on metal. 
And in 2012 he reported success’®: he had grown layers of silicene on 
silver, which has an atomic structure that matches the 2D material per- 
fectly (see Nature 495, 152-153; 2013). 

Buoyed by that effort, Le Lay and others have 
since moved down the carbon column of the 
periodic table. Last year, he demonstrated a 
similar technique to grow a 2D mesh of ger- 
manium atoms — germanene — ona substrate 
of gold’. His next target is stanene: a 2D lat- 
tice of tin atoms. Stanene should have a band 
gap larger than either silicene and germanene, 
which would allow its devices to work at higher 
temperatures and voltages. And it is predicted 
to carry charges only on its outside edges, so 
it should conduct with super efficiency. But Le Lay has competition. 
Although no one has yet reported growing stanene successfully, research 
groups in China are rumoured to be close. 


ELEMENTAL SHIFT 
Others are exploring different parts of the periodic table. Zhang's team 
and another led by Peide Ye at Purdue University in West Lafayette, Indi- 
ana, last year described’*”’ stripping 2D layers of phosphorene from black 
phosphorus, a bulk form of the element that has been studied for a cen- 
tury. Like graphene, phosphorene conducts electrons swiftly. But unlike 
graphene, it has a natural band gap — and it is more stable than silicene. 
Phosphorene has enjoyed a meteoric rise. At the 2013 meeting of the 
American Physical Society, it was the subject ofa single talk by members 
of Zhang's group; by 2015, the meeting had three entire sessions devoted 
to it. But like its fellow pure-element 2D materials, phosphorene reacts 
very strongly with oxygen and water. Ifit is to last longer than a few hours, 
it needs to be sandwiched between layers of other materials. This natu- 
ral instability makes fabricating devices with the ‘enes’ difficult; Le Lay 
estimates that around 80% of the papers about them are still theoretical. 
Nonetheless, both Zhang and Ye succeeded in making phosphorene 
transistors. This year, the first transistor from silicene emerged”, 
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FLAT-PACK ASSEMBLY 


Transition-metal dichalcogenide (TMDC) crystals contain one transition-metal 
atom (green) for every two chalcogen atoms (orange). Some 40 such TMDCs — 
mostly those made with the metals highlighted in dark green in the periodic 
table — can be split into 2D layers that are flexible, transparent and excellent 
conductors of electricity. Some are also semiconductors. 
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although it survived for only a few minutes. Still, Le Lay is optimistic 
that these issues are not insurmountable. Just two years ago, he points 
out, Geim and other physicists were saying that a silicene transistor 
could not be made with current technology. “So it’s always dangerous 
to predict the future,’ laughs Le Lay. 


THE NEXT DIMENSION 

Even as some physicists search for new 2D materials and try to under- 
stand their properties, others are already sandwiching them together. 
“Instead of trying to pick one and say this is the best, maybe the best 
thing to do is to combine them in such a way that all their different 
advantages are properly utilized,” says Kis. 

This could mean stacking components made of different 2D materials 
to make tiny, dense 3D circuits. Materials could also be layered inside 
components — something that chip designers already do when they 
grow layers of different semiconductors on top of one another to make 
devices such as the lasers inside DVD players. In standard devices, this is 
tricky: each layer has to chemically bond with the next, and only certain 
combinations can be matched. Otherwise, the strain between the dif- 
ferent crystal lattices in each layer makes bonding impossible. With 2D 
materials, that problem goes away: the atoms in each layer bond only 
very weakly to the neighbouring sheets, so the strain is minimal. Many 
layers of semiconductors, insulators and conductors can be stacked 
to form complex devices known generically as van der Waals hetero- 
structures, after the weak bonds that bind the layers. 

Already, for example, graphene has been used alongside MoS2 and 
WSe2 to create the junctions at the heart of solar cells'* and photo- 
detectors'®, exploiting the semiconductor’s abilities to absorb photons 
and graphene’s swift ability to carry the freed electrons away. In Febru- 
ary this year, Novoselov and his team reversed the solar-cell concept to 
make a light-emitting diode’’ from MoS, and other TMDCs between 
graphene electrodes. By selecting different TMDCs, the team could 
choose the wavelength of the photons released. 

Better still, sandwiching together different 2D layers can allow physi- 
cists to fine-tune their devices. Although the bonds between layers are 
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weak, the close proximity of their atoms means that they can affect 
each other’s properties in subtle ways, says Wang Yao, a physicist at 
the University of Hong Kong. Stacking order, spacing and orientation 
all control device behaviour. “Modelling this gives theorists like me a 
headache, but the new physical properties are definitely there,’ says Yao. 

Even graphene can get a leg up from other 2D materials, says Marco 
Polini, a physicist at the National Enterprise for Nanoscience and Nano- 
technology (NEST) in Pisa, Italy. His team has been working on devices 
in which graphene is sandwiched between 2D layers of the insulator 
boron nitride'’. When laser light is focused on the device, it gets com- 
pressed and channelled through the graphene layer, much more than in 
devices that sandwich graphene between bulk materials. In principle, 
this could provide a way for information to be carried between chips 
using photons rather than electricity. That, says Polini, could mean 
faster and more efficient communications within the chips. 


PRACTICAL PREDICTIONS 
The current buzz around 2D materials is reminiscent of the excitement 
about graphene in 2005, says physicist Jari Kinaret of Chalmers Univer- 
sity of Technology in Gothenburg, Sweden, who heads the European 
Union's Graphene Flagship — a programme that also studies other 2D 
materials. But Kinaret cautions that it could take two decades to really 
assess the potential of these materials. “The initial studies on 2D materi- 
als are focusing a lot on their electronic properties, because these are close 
to physicists’ hearts,’ says Kinaret. “But I think that the applications, if 
and when they come, are likely to be in a completely unpredicted area.” 

Materials that look good in the lab are not always those that make it 
out into the real world. One major issue facing all 2D materials is how 
to produce uniform, defect-free layers cheaply. The sticky-tape method 
works well for TMDCs and phosphorene, but is too time-consuming to 
scale up. It is also expensive to make bulk black phosphorus, because 
it involves subjecting naturally occurring white phosphorus to extreme 
pressure. No one has yet perfected the process of growing single sheets of 
2D materials from scratch, let alone the layered structures that physicists 
find so promising. “It took a long time to make our heterostructures,’ says 
Xiaodong Xu, a physicist at the University of Washington in Seattle. “How 
can we speed that up or make it automatic? There is a lot of work to do” 

Such practical considerations could prevent 2D materials from living 
up to their early promise. “There have been many rushes like this, and 
some have turned out to be fads,’ says Kis. “But I think the sheer number 
of materials and different properties should make sure something comes 
out of this.” Meanwhile, the zoo is expanding, says Coleman. Arsenene, 
a heavier cousin to phosphorene, is already on researcher's minds. 

“As people start to branch out, they are discovering new materials that 
have these wonderful properties,’ says Coleman. “The most exciting 2D 
material is probably one that hasn't been made yet.” mSEE BOOKS & ARTS P.284 


Elizabeth Gibney is a reporter for Nature in London. 
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This year’s deadly earthquakes in Nepal killed more than 8,000 people and reduced thousands of buildings to rubble. 


Pool knowledge to stem 
losses from disasters 


Public awareness, rigorous risk research and aligned targets will help policy-makers 
to increase resilience against natural hazards, say Susan L. Cutter and colleagues. 


quakes in Nepal killed more than 

8,400 people, injured 20,000 and reduced 
300,000 houses to rubble. In March, Cyclone 
Pam destroyed homes, schools, infrastruc- 
ture and livelihoods on the Pacific island 
of Vanuatu, affecting half the population, 
including 82,000 children. Both nations will 
take years to recover. 

The number and severity of disasters is 
increasing (see ‘Catastrophic rise’). Annual 
global economic losses from geophysical, 
hydro-meteorological and climatological 
events could almost double from their 2005 
levels by 2030 to exceed US$300 billion if the 
past decade’s trend continues. The figures 


I: April and May, two massive earth- 


may worsen as climate change, globalization, 
technological change, urbanization and 
political and economic instability put more 
people and assets at risk. 

Improved disaster-risk management and 
resilience is essential for sustainable socie- 
ties’, But the science of natural hazards is too 
fragmented to influence policy effectively. 
Seismologists, for example, had long warned 
in specialist journals that Nepal’s Kathmandu 
region was due a large earthquake. Local poli- 
ticians did not strengthen construction codes, 
reinforce old buildings or inform the popula- 
tion about potential risks. Had such measures 
been implemented — as they have in Japan, 
California and Chile — the death toll would 


© 2015 Macmillan Publishers Limited. All rights reserved 


have been lower (see “Three lessons yet to be 
learned’). Similarly, structures in flood-prone 
areas can be elevated; those in cyclone zones 
wind-proofed; and the public educated about 
such possibilities. 

Sadly, hazard mitigation is not a vote- 
winner. It pits long-range investments against 
short-term political cycles — even though it 
is cheaper to prevent losses than to rebuild 
after them*”. Reinforcing the levees of New 
Orleans, Louisiana, against hurricane storm 
surges would have cost ten times less than 
rebuilding neighbourhoods after Hurricane 
Katrina. It is more politically expedient to 
respond afterwards when constituents are 
demanding assistance. Public awareness > 
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> of the scale of disaster risks is hindered by 
the breadth and complexity of research, span- 
ning the natural, social and health sciences, 
law, humanities and engineering. 

In March, governments met under the aus- 
pices of the United Nations in Sendai, Japan, 
to negotiate an international agreement to 
reverse the rising trend of disaster losses. 
Unlike previous voluntary agreements, the 
Sendai Framework for Disaster Risk Reduc- 
tion 2015-2030 has set measureable targets. 
One goal is to lower average death rates 
and economic losses in 2020-30 relative to 
2005-15 (by what percentage is not specified). 

For the Sendai agreement to succeed, an 
open and comprehensive source of vetted 
information on disaster-risk reduction is 
needed. It would provide evidence for moni- 
toring progress towards the goals. We call on 
the scientific community to set up an inter- 
national assessment process to feed such 
information into disaster policy and practice. 


SPLINTERED APPROACH 
The community of disaster-risk researchers 
is small and splintered into disciplines that 
are focused on single natural hazards. Only 
recently have seismologists worked with 
geodesists to determine how changes in 
Earth's shape and gravity field apply loads to 
faults’. Disaster-medicine researchers rarely 
meet public-health professionals or read 
social-science journals’. Local or indigenous 
knowledge and the on-the-ground experi- 
ences of emergency managers and humani- 
tarian agencies are often excluded. 
Governments need holistic solutions, not 
incremental proposals that solve one aspect 
but ignore wider ramifications. A move 
towards integrated disaster-risk research — 
bringing together disciplines to focus on par- 
ticular problems and social needs — is filling 
this void®. But lack of a critical overview of 
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The number of disasters caused by natural events has more than doubled since 1980. 
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what is known about disaster risk leaves 
politicians without guidance. 

This is why early political efforts to reduce 
disaster losses foundered. The UN desig- 
nated the 1990s as the International Dec- 
ade for Natural Disaster Reduction, and in 
1999 created the UN Office for Disaster Risk 
Reduction (UNISDR) to coordinate national 
efforts. In the absence of a legally binding 
treaty with targets, and consequences for not 
achieving them, UNISDR has been largely 
limited to promoting risk awareness and 
facilitating institutional development. 

In 2005, some 168 countries signed up to 
the Hyogo Framework for Action (HFA), 
pledging to reduce disaster losses by 2015. 
Again, this did not occur. Of the ten costliest 
disasters between 1980 and 2014, those that 
happened after the agreement were more than 
four times as expensive as those in the decade 
before. Increased vulnerability and exposure 
account for some of the increase — but not all. 

At least 35 nations, including Colombia, 
Brazil and India, now include disaster risk 


HAZARD PREPARATION 


Three lessons yet to be learned 


Build well to save life and property. At least 
since the 1755 Great Lisbon earthquake, 
scientists and politicians have known 

that it is not earthquakes that kill people, 
but the resultant building collapse. Still, 
construction in hazard-prone regions 
frequently flouts standard building codes. 


Invest in pre-disaster mitigation. 
Spending 5-10% of the funds necessary 
for recovery after a disaster on mitigation 
saves lives and resources’. Yet few 

nations bother. The estimated cost of 
protecting New Orleans, Louisiana, against 
hurricanes and flooding, for example, was 
US$14 billion before Hurricane Katrina in 
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2005; the total losses actually incurred that 
year were $125 billion. 


Be prepared. Emergency-management 
plans should be practised long before a 
disaster occurs. Preparedness also entails 
warning systems, education, awareness 
raising and risk communication. Most poor 
countries do not do this. For example, when 
Typhoon Haiyan approached the Philippines 
in 2013, had the local emergency managers 
communicated to residents that a tsunami 
was coming — instead of issuing a warning 
using the scientifically correct term ‘storm 
surge’ — more locals would have evacuated 
coastal areas. 
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and reduction in their development strategies. 
Most do not. Many financial donors view 
disasters as interruptions in development, 
not as risks that need to be managed. Disas- 
ter-risk-assessment reports are too broad to 
guide municipalities and nations. For exam- 
ple, UNISDR’s Global Assessment Reports or 
the World Economic Forum's Global Risks 
reports provide global and thematic over- 
views, often consider risks qualitatively rather 
than quantitatively, and neglect the collective 
impacts of personal choices such as whether 
to purchase insurance or relocate’. 

The Sendai framework calls on govern- 
ments to do four things: understand disaster 
risk; strengthen risk governance to manage 
risks across all sectors; invest in risk-reduc- 
tion measures that promote resilience; 
and enhance disaster preparedness and 
responses so that nations “build back better” 
in their recovery. It tasks research networks 
with focusing on the root causes of disasters 
and probable emergent risks; supporting 
action by local communities and authori- 
ties; and engaging policy-makers. 


REVIEW MECHANISM 
A coherent science-based assessment process 
for disaster-risk reduction should be created 
to provide sound knowledge to inform 
decision-making, and to assist governments 
worldwide in setting policies and goals and to 
identify research gaps. By taking an integrated 
approach, such an assessment would go 
beyond previous proposals for international 
panels on natural hazards and disasters’. 
Disaster-risk reports should identify what 
is known and where there are gaps in our 
knowledge. They must summarize informa- 
tion relevant to the Sendai targets. And they 
should examine the root causes of vulner- 
abilities and exposure, the potential socio- 
economic impacts of natural hazards and the 
ways to reduce (if not prevent) human and 
economic losses. Finally, such an assessment 
should provide a mechanism for knowledge 
transfer from research to practice, ensuring 
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that the science is useful, usable and used’. 

A high-level, transdisciplinary body 
of international experts in disaster-risk 
reduction should be established by national 
governments and international organiza- 
tions dealing with disaster risks, with input 
from various sectors and civil society. Such 
a body would have the reach and influence 
— from local communities, businesses and 
governments — to raise people's awareness. 
The same findings presented by an inde- 
pendent scientist or article would not. 

The main practical difficulties will be in 
incorporating the field’s diverse information 
and practices into an assessment, and demon- 
strating to policy-makers that it need not take 
a extreme event to cause catastrophic human 
consequences”. Government support for 
the process will be essential. Synergies must 
be found by combining and consolidating 
disaster-risk reduction efforts across UN 
institutions. 

Disaster-risk management, climate 
change and sustainable development tar- 
gets will need to be aligned. For example, 
there should be a coordinated assessment of 
the state of knowledge in disaster risk and 
its utility for supporting the UN Sustainable 
Development Goals and the UN Framework 
Convention on Climate Change. 

Knowledge transfer will make community- 
based resilience efforts possible. Iuminating 
findings, best practices and state-of-the-art 
modelling must become part of the evidence- 
based strategy for disaster-risk reduction. m 
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Fossil-fuel factories are demolished in China’s Hebei province. 


Steps to China’s 
carbon peak 


Regional targets and improved market mechanisms 
could enable the nation’s carbon dioxide emissions to 
peak by 2030, say Zhu Liu and colleagues. 


the global total of carbon dioxide for 

the year, and 1.5 times that released 
by the United States. It is the world’s lead- 
ing emitter of the gas. Without mitiga- 
tion, China’s CO, emissions will rise by 
more than 50% in the next 15 years. Last 
November, in a joint announcement with 
the United States, China pledged that its 
CO, emissions will peak by 2030. Exist- 
ing policies are insufficient to make that 
happen. 

China has a two-pronged strategy for 
tackling climate change — increasing 
energy efficiency in manufacturing, and 
controlling regional emissions through tar- 
gets. This approach has been effective: we 
calculate that CO, emissions per unit gross 


I n 2013, China released one-quarter of 
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domestic product (GDP) fell by 28.5% 
between 2005 and 2013. This is equivalent 
to a saving of more than 2.5 gigatonnes of 
CO, (half of US total emissions). A fledg- 
ling domestic carbon-emissions-trading 
scheme (ETS) has slowed CO, emissions 
growth by 2-3% in the seven provinces or 
cities in which it has been piloted. 

But rapid economic growth, espe- 
cially the building of infrastructure, has 
more than offset these reductions. Total 
national CO, emissions grew by 7% each 
year between 2005 and 2013, although 
growth of emissions is slowing in cities 
such as Beijing and Shanghai as advanced 
technologies and cleaner energy sources 
are increasingly being used. 

Curbing this growth will needa > 
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> decrease in projected emissions between 
2015 and 2035 that is almost equivalent to the 
total global CO, emissions in 2013 — 30 giga- 
tonnes (see ‘Carbon crunch). This challeng- 
ing goal is within reach if China keeps its 
annual growth in emissions below 2%, rolls 
out a national carbon-trading system and 
obtains 30% of its energy from renewables 
and natural gas by around 2035. China’s per 
capita emissions in 2030 would then mirror 
those of the European Union (EU) in 2013 (a 
little under 8 tonnes of CO, per person). 

We propose four steps for China to hit this 
ambitious target. It must strengthen regional 
emissions targets; improve the reporting and 
verification of emissions data; enhance the 
regulation and supervision of a nationwide 
emissions-trading market; and incentivize 
the uptake of green technologies, especially 
in underdeveloped regions. 


Set regional emissions targets. A single 
target will not be suitable for all of China's 
30 provinces and autonomous regions, 
which have different energy-source mixes, 
uses and economic development needs. 
Peak emissions targets need to be designed 
for each region such that the national total 
falls by 2030. 

Wealthy cities such as Beijing, Tianjin and 
Shanghai, which have reversed emissions 
growth since 2011, could cap their emissions 
and set and adjust reduction targets at five- 
year intervals. Developed coastal provinces, 
such as Guangdong, Zhejiang and Jiangsu, 
that have benefited from the pilot ETS since 
2010 should pledge more-aggressive and 
earlier peak targets — for example, reduc- 
ing carbon emissions per capita to below 
those of the EU before 2030 or even 2020. 
Emissions in underdeveloped regions such 
as Shanxi province and the Ningxia and Xin- 
jiang autonomous regions could be allowed 
to peak after 2030 to leave room for more 
infrastructure construction. 

Fossil-fuel use needs to be curtailed first. 
The country’s ambitious plan to cap national 
coal consumption in 2015 at 3.9 gigatonnes 
must be met. Beijing and Tianjin are 
required to halve their coal consumption by 
2017 to meet mandated air-quality require- 
ments. All conventional coal-fired plants in 
Beijing will be converted to gas by the end of 
2015 and an extra 13 billion cubic metres of 
natural gas from western China and 12 bil- 
lion cubic metres from other countries are 
being piped to Beijing and the surrounding 
cities. Smaller cities and other regions, too, 
must reduce their coal use. 

China can strengthen technology-driven 
improvements in emission intensity (CO, 
emissions per unit of GDP), especially in the 
less developed regions. But loopholes also 
need to be closed. For instance, two-thirds 
of regions achieved their 2002-09 intensity- 
reduction targets by enlarging the scale of 
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production’, which increased national emis- 
sions by 50% (ref. 2). To end this practice, 
emission-intensity targets should be sup- 
plemented by indicators of physical emis- 
sions efficiency, for example, emissions per 
unit of steel production. Such indicators are 
easier to monitor and verify and can be used 
to measure the efficiency of sectors, as well 
as of individual factories. 

Air-quality indicators, such as particu- 
late concentrations, should be integrated 
with emissions targets. Duplicate invest- 
ments and work plans are common, with 
China's National Development and Reform 
Commission (NDRC) in charge of climate- 
change mitigation and the Ministry of 
Environment responsible for air-pollution 
abatement. Better coordination between 
departments and a set of joint emissions- 
control measures — managed by one agency 
— are needed. 

Emissions targets must be decoupled from 
GDP. Criteria by which local and provincial 
leaders are evaluated for career promo- 
tion should include energy efficiency and 
pollution-abatement targets, rather than 
measures of economic development. Oth- 
erwise, greater growth will always trump 
environmental sustainability. 


Ensure transparent emissions monitor- 
ing, reporting and verification. Data sets 
for China's total carbon emissions, especially 
those generated by regions and sectors, are 
not always reliable. Official national emis- 
sions totals have differed from self-reported 
provincial statistics by 20%. And in sectoral 
data sets, factories may overstate their emis- 
sions to make it easier for them to achieve 
reduction targets’. 


Also, the average car- “China 
boncontentofChinas Produced 
coal is not accurately 60% of the 
known, given the world’s solar 
variation in the qual- photovoltaic 
ity ofcoalusedacross cells in 2014, 
different regions. yet less than 
As afirst step, the 5% were 
Chinese Academy of installed 
Sciences is coordinat- domestically. i 


ing a national investi- 
gation of major emissions sources and sinks 
that should be complete by the end of this 
year’. This should be expanded to include 
private and small-scale enterprises, which 
tend to use inefficient and dirty technolo- 
gies. Small boilers, for example, consume 
about 10% of China’s coal and are one of the 
main sources of air pollution in Beijing. 
The compilation and reporting of local 
emissions data are coordinated by regional 
environmental-protection departments, 
which also issue environmental certifi- 
cates for firms and construction projects. 
This conflict of interest sometimes leads 
to data manipulation and bribery; each 
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year, hundreds of government officers are 
investigated. 

The government needs to set up an 
independent agency to monitor and verify 
emissions data for regions, sectors and 
companies. The regulator would establish 
baseline emissions inventories and verify 
reductions through spot inspections, mod- 
elling and the installation of tamper-proof 
monitoring equipment. 

Monitoring by third parties — the public, 
the media and non-governmental organi- 
zations (NGOs) — needs to be encouraged. 
Environmental laws and regulations must 
offer citizens the right to know emissions 
figures and the right to sue an agency or 
company if information has been concealed 
or manipulated. 

State-owned projects should make infor- 
mation available to the public by publishing 
it online, as is done with Beijing's air-quality 
data. More web portals should be established 
to show the improvement, or lack ofit, in pol- 
lution reduction over time. Examples include 
the Air Quality Index map (see go.nature. 
com/82gvyn), run by a Beijing-based NGO, 
and the pollution map produced by the 
non-profit Institute of Public and Environ- 
mental Affairs in Beijing, which displays 
real-time emissions information for more 
than 6,000 large-scale enterprises in China 
(see go.nature.com/vb6uht). Such efforts will 
enable government regulators and citizens’ 
groups to ‘name and shame’ companies that 
do not adhere to environmental regulations. 


Expand carbon trading nationwide. Chi- 
nas pilot ETS traded nearly 14 megatonnes 
of CO, in 2013. This is less than 0.5% of the 
national total, but the scheme is still the 
second largest in the world, after that of the 
EU. Starting in 2016-17 (ref. 5), the NDRC 
intends to extend the scheme to all 30 Chi- 
nese provinces, making China the world’s 
largest carbon market. In the pilot ETS, 
each participant set its own cap and decided 
which sectors it would cover. For example, 
transport is included only in Shanghai’s ETS. 
Each also determined how compliance is 
measured. Hubei province targets enter- 
prises consuming more than 60,000 tonnes 
of coal-equivalent per year, a threshold six 
times that of Guangdong. 

In the expanded scheme, a national 
emissions cap must be set and central gov- 
ernment should develop standards and 
a timeline to unify cap criteria across the 
country. The first step should be to introduce 
a national ETS for the top six CO,-emitting 
industries — power generation, ferrous 
and non-ferrous metallurgy, construction, 
chemicals production and aviation services 
— to be extended later to others. Regulations 
and laws are needed that mandate emissions 
caps for certain enterprises and encourage 
emissions trading. Accounting and tax 


CARBON CRUNCH 


China’s carbon dioxide emissions will grow beyond the 


year 2030 unless it adopts strict low-carbon measures. 


Greater production efficiency, use of renewable energies and natural gas, and nationwide emissions-trading schemes 
can allow emissions to peak by 2030, and reduce national CO, emissions by 30 gigatonnes (Gt) by 2035. 
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treatments need to be clarified. 

A cascading management framework of 
carbon trading needs to be established, with 
provincial ETS exchanges linked to higher- 
level, regional exchanges. For example, Bei- 
jing’s exchange could coordinate provincial 
exchanges across northern China. 

Market mechanisms should also be intro- 
duced to the energy-supply system. First, 
the monopoly of electricity and energy 
supply by major state-owned companies 
must be broken to encourage innovation 
and improved efficiency. PetroChina and 
Sinopec, the two main state-owned oil 
companies, together account for 80% of the 
national oil supply and 10% of the nation’s 
total carbon emissions. 

Second, an energy-supply market needs 
to be set up to allow prices to respond to 
demand and incentives. The price for energy 
delivered to the grid is now controlled by the 
government and is largely static. This creates 
barriers to the connection of power from 
renewable sources: low-carbon electricity is 
sold at a higher price than electricity gen- 
erated from fossil fuels. Subsidies for fossil 
fuels, such as the contract price of coal sold 
to power plants, need to be removed so that 
renewables and low-carbon energy tech- 
nology can be competitive. A carbon tax — 
starting low and ratcheting up over the next 
decade — on certain emissions-intensive 
sectors, such as transport, that are suscepti- 
ble to a change in behaviour would slow the 
growth in fossil-fuel consumption. 


Advance green technology in under- 
developed areas. Government incen- 
tives have stimulated double-digit growth 


Best-practice 
low-carbon scenario 


Best practices 


of CO,. 
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in renewable-energy technologies in the 
past decade. The ‘green-energy’ industry 
will contribute 2% of GDP (45,000 billion 
yuan) to the economy by the end of 2015. 
Yet the nation’s focus is still on manufac- 
turing rather than implementation. China 
manufactured wind-power capacity equiva- 
lent to 114,763 megawatts in 2014 (half of 
the world’s added capacity for the year), 
but only one-third of that operates in the 
country. China also produced 60% of the 
world’s solar photovoltaic cells in 2014, yet 
less than 5% of these panels were installed 
domestically. 

Incentives should be targeted to domes- 
tic consumers of renewable energy and 
eco-products, instead of encouraging pro- 
duction for the global market. Low-carbon 
technologies should be targeted especially 
at power production in China’s underdevel- 
oped and poor regions. These regions pro- 
vide power and goods for others and stand 
to gain most from generation efficiency and 
development. 

In 2010, for example, the emission factor 
(CO, emissions per kilowatt-hour of elec- 
tricity generated) of coal-fired thermal 
power plants in Inner Mongolia was 20% 
higher than that of power plants in Guang- 
dong on the south coast’. Yet one-third of 
the power generated by Inner Mongolia is 
exported to the east-coast regions®. Poor 
central and western provinces should 
instead exploit their vast land areas for solar 
and wind farms. 

Regional grid construction and integrated 
technologies should be encouraged. China is 
a world leader in ultra-high-voltage trans- 
mission technology. This will be essential 
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for linking areas generating renewable 
electricity, mainly in the west, to the large 
coastal cities. Initiatives to develop ‘smart’ 
electricity grids and high-speed connec- 
tions would save energy through improving 
electricity distribution. Mass-transit systems 
and regional rail expansion would similarly 
reduce emissions from road traffic and 
construction. 

We call on China's leaders to be bold 
and create a low-carbon economy that will 
benefit the country and the globe. m 
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Plants regulate important processes such as photosynthesis through pores called stomata. 


PLANT SCIENCE 


Rediscovering the 
bush telegraph 


Jan T. Baldwin assesses three books on the rich array of 
plant behaviours, from sensing to communication. 


r Ahe food writer Michael Pollan, author 
of The Omnivore's Dilemma (Penguin, 
2006) among others, wrote an article 
in The New Yorker in 2013 exploring why 
terms such as intelligence, memory and even 


behaviour have been contentious for plant 
scientists. His thesis boils down to a divide in 
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biology that allows zoologists to use anthro- 
pomorphic terms, but denies the privilege 
to plant scientists. Pollan allies himself with 
a small band of intrepid researchers crusad- 
ing against the “cerebrocentric” view that 
permits behaviour only to organisms with 
brains. He tells of collateral damage from 
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Leaf Defence 
EDWARD E. FARMER 
Oxford Univ. Press: 2014. 


Plant Sensing and Communication 
RICHARD KARBAN 
Univ. Chicago Press: 2015. 


Plant Behaviour and Intelligence 
ANTHONY TREWAVAS 
Oxford Univ. Press: 2014. 


sensationalist treatments that exaggerate 
plant-science findings, and of glimmers of 
a new sensitivity towards all life. 

Pollan identifies an interesting story 
about the development of an emerging 
scientific field, and the baggage that scien- 
tists bring to their work. The idea that plants 
are ‘smarter’ than their immobility suggests 
is now supported by rigorous experimenta- 
tion and fieldwork that are uncovering the 
genes and chemicals that mediate plants’ 
environmental intelligence. We know now 
that much ofa plant’s rich behavioural rep- 
ertoire is hard to observe because it is played 
out in a chemical arena. Plants overcome the 
constraints of immobility mainly by har- 
nessing their prowess as synthetic organic 
chemists. For instance, floral scents contain 
compounds that attract pollinating ani- 
mals and repel flower-eating ones. Nectar 
is a brew of nutrients and toxins that opti- 
mize the behaviour of pollinators. Much of 
the relevant literature is now synthesized 
in three books by leading researchers in 
the field: Edward Farmer’s Leaf Defence, 
Anthony Trewavas'’s Plant Behaviour and 
Intelligence and Richard Karban’s Plant Sens- 
ing and Communication. 


RESEARCH RENAISSANCE 
The research is essentially a rediscovery. 
Between the eighteenth and the twenti- 
eth centuries, Erasmus Darwin, Charles 
Darwin and botanists Wilhelm Pfeffer and 
Jagadish Chandra Bose published books 
that brim with observations on plant behav- 
iour. Charles Darwin's 1880 The Power of 
Movement in Plants, for instance, incorpo- 
rates insights from his experiments on how 
exposure to light drives bending in grasses, 
which laid the foundation for the discovery 
of the hormones responsible for plant move- 
ment. If they were alive today, these scientists 
would wonder what all the fuss is about. 
The study of plant defences was revived 
in 1959, when entomologist Gottfried 
Fraenkel published the landmark article “The 
raison détre of secondary plant substances’ 
(G. S. Fraenkel Science 129, 1466-1470; 
1959), building on original insights by nine- 
teenth-century botanist C. Ernst Stahl. 
Fraenkel argued that plants’ awesome 
diversity of secondary metabolites such as 
toxic alkaloids and aromatic terpenoids 
evolved primarily to thwart herbivores. 
He ushered in the modern era of chemical 
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ecology, including an exuberance of theory 
to explain these metabolites’ patterns of 
accumulation. Much of the theorizing came 
from zoologists, who applied paradigms 
from animal-behaviour studies to plants, 
often without deeper understanding of how 
plants function as organisms. The plant- 
biology community had little patience for 
this, being at the time focused on under- 
standing mechanisms of plant growth under 
controlled laboratory conditions, developing 
Arabidopsis as a model plant and embracing 
molecular biology. After reading Farmer, 
Karban and Trewavass books, I felt that the 
prevailing scientific opinion that plants are 
mere autotrophic growth machines has per- 
haps as much to do with the divorce between 
molecular and organismic approaches as 
with the botany-zoology divide. 

Leaf Defence considers why plants have or 
lack leaves, and why leaves evolved to have 
particular traits. Because leaves dominate 
most plants’ above-ground biomass, herbi- 
vores usually home in on them, a circum- 
stance that has shaped leaves’ chemistry, 
physics, morphology and development. 
Plants, meanwhile, eat sunlight with their 
leaves. So to fathom leaf evolution, it is impor- 
tant to understand both herbivores’ digestive 
tracts and the dynamics of solar radiation. 

Farmer works on jasmonates, a fatty-acid- 
based signalling system that activates many 
of plants’ defences against herbivores. Leaf 
Defence provides an excellent review of the 
rapidly evolving literature in this field. The 
only disappointment is that books do not 
evolve as rapidly even as the literature from 
the author’s own lab. Farmer's book was writ- 
ten before his laboratory reported a land- 
mark discovery: a molecular link between 
jasmonate signalling and electrical signalling 
induced by herbivore attack (S. A. R. Mousavi 
et al. Nature 500, 422-426; 2013). 

Farmer weaves his compelling narra- 
tive from hard-core molecular research 
on mechanisms of perception of herbivore 
attack and defence responses, to more specu- 
lative inferences about how leaf colour and 
escape to rocky outcrops help plants to sur- 
vive the relentless interests of herbivores. He 
highlights the lack of evidence for some sen- 
sationalist stories of plant-plant signalling, 
such as one about signalling between South 
African acacia trees about impending attacks 
from antelopes. And of the three books, Leaf 
Defence is the only one to include structures 
of plant chemicals. 

The plant ‘behaviour’ debate described 
by Pollan is the starting point for the other 
two books. In Plant Sensing and Commu- 
nication, Karban, a 
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respond, such as light and sound. Karban, 
who featured prominently in Pollan’s article, 
largely avoids political melodrama, instead 
providing clear working definitions of some 
of the contentious vocabulary: communica- 
tion, eavesdropping, learning and memory. 
His summaries of plants’ sensory abilities 
— such as the cues and signals that they use 
to adjust to the environment — retain an 
evolutionary perspective. And he branches 
out into areas such as mate choice, for exam- 
ple detailing how plants selectively breed 
with specific fathers represented in a mixed 
pollen load. 

Karban’s research interests overlap with 
mine; we even work on the same species (coy- 
ote tobacco, Nicotiana attenuata). Although 
he is a gifted writer and synthesizer, Karban’s 
review of the research on the mechanisms 
of plant perception and response is not well 
served by his lack of familiarity with the 
techniques that uncover them. Issues that 
he identifies as central unsolved problems 
(such as whether the volatile ‘alarm calls’ 
of plants protect against herbivores and 
increase fitness under real-world condi- 
tions) have been solved and published. The 
unrealistic experimental designs so common 
in the early literature are frequently reported 
uncritically. In discussing the complicated 
process of demonstrating an adaptation, 
Karban does not mention the genes respon- 
sible for the expression of traits such as shade 
avoidance, or the advantages offered in this 
area by cheap sequencing. So I recommend 
Plant Sensing and Communication to those 
who want a low-calorie introduction and a 
deeper consideration of the concepts, but the 
primary literature is better for understanding 
the mechanisms. 


BOTANICAL MECHANICS 
Trewavas, by contrast, moves effortlessly 
from mechanistic research to invigorating 
insights into real-world plant behaviour. 
Plant Behaviour and Intelligence is a wild 
ride, covering ground from the origins of life 
to intelligent nutrient-foraging behaviour 
in the roots of higher plants. Trewavas's five 
decades of research into plants’ molecular 
biology and physiology, and their evolution 
as self-organizing systems, make him fully 
‘phytomorphized’. He thinks like a plant, 
effortlessly calling on specific traits to look 
at how plants solve problems in similar ways 
to social insects — from siphonogamy (in 
which pollen tubes carry sperm cells to egg 
cells) to highly dispersed sensory systems. 
He celebrates behaviour in plants while 
avoiding “animal envy”. At 76, Trewavas may 
have written his last book — but it is one that 
encapsulates his life’s work beautifully. 
Plant Behaviour and Intelligence is also 
peppered with references to classics in the 
field, and takes inspiration from the geneti- 
cist Barbara McClintock's “feeling for the 
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Charles Darwin systematically investigated 
plant behaviour in the late nineteenth century. 


organism’, which she derived from her 
long study of maize (corn) and discovery of 
‘jumping genes, or transposons. Trewavas 
and Farmer share a deep understanding of 
how plants function, mechanistically and 
biochemically, and this anchors their forays 
into how plants work in the real world. This 
grounding in mechanism may prove to be the 
key to convincing a scientific community sen- 
sitized to the gap between populist hype and 
the data on plant behaviour. 

Plant biology is emerging from its 
Arabidopsis-in-the-laboratory phase. 
Researchers need to understand the natural- 
history context of such models before plant 
behaviours can be sensibly scrutinized in the 
functional (natural) context. The infusion 
of ideas from behavioural ecology between 
1950 and 1990 revived the field, but it now 
needs balance. The mental constructs that 
ecologists bring to experimentation on plant 
function, such as the cost-benefit paradigm 
applied to resource allocation, are of lim- 
ited value when they are unmoored from 
the mechanisms that anchor these phe- 
notypes. These three books suggest that 
the way forward may be through a return 
to the observational skills that allowed the 
great researchers of the twentieth century to 
become phytomorphized. = 


Ian T. Baldwin is the founding director 
of the Max Planck Institute for Chemical 
Ecology in Jena, Germany. 

e-mail: baldwin@ice.mpg.de 
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Cinzia Casiraghi (left) and Sara Lowes collaborated to create the Graphene Suite. 


Q&A Cinzia Casiraghi & Sara Lowes 
Maestros of graphene 


Composer Sara Lowes has teamed up with materials scientist Cinzia Casiraghi at the 
University of Manchester, UK. The result, Lowes’ six-part Graphene Suite, premieres next 
week at the Graphene Week 2015 conference in Manchester, part of the European Union's 
decade-long, €1-billion (US$1.1-billion) Graphene Flagship research programme. Lowes and 
Casiraghi talk crotchets, carbon chemistry and the commonalities between women in science 


and women in music. 


How did you come to work together? 

SL: The idea came from Brighter Sound, a 
cultural organization in Manchester that 
wanted to explore the common ground 
between the creative processes in music and 
in science, particularly graphene. To be given 
something I know nothing about and to use 
that to create a piece is fascinating. The brief 
was for a female musician to do this, because 
composition is a hugely male-dominated 
profession. 


CC: That is one of the similarities between 
science and composition: there are not 
enough good role models for women. The 
issues are the same, whether it is in science, 
music or any other field. I am a member 
of the University of Manchester school of 
chemistry’s Athena SWAN committee, 
which supports women in science, so I was 
very keen to collaborate with Sara. 


284 | NATURE | VOL 522 | 18 JUNE 2015 


Was it difficult to span 
the cultures of music 


Graphene Suite 
25-28 June 2015, 


and science? Manchester, UK. 
7 * . http://go.nature. 
SL: Composing music Sen 


is like science in that 

you start with an idea and ask, ‘I wonder 
what would happen if ...?’ Cinzia and I met 
for the first time in summer 2014. She talked 
about her job, we met her colleagues in the 
chemistry and physics departments, and 
she showed me around a clean room. I had 
never been in a university science depart- 
ment before, and I kept feeling frustrated 
that people outside science do not know any- 
thing about it. Science is part of everything 
around us; it is part of our world. 


CC: Collaboration is an important part of 
my research, which is often multidiscipli- 
nary. And although I work on graphene now, 
Icome from a completely different scientific 


© 2015 Macmillan Publishers Limited. All rights reserved 


background — nuclear engineering — so it 
is quite natural for me to find connections 
between fields that you might think do not 
have anything in common. Collaboration 
helps us to see problems from different 
points of view, to find solutions or develop 
projects in a new direction. 


Do you ever worry that graphene will not live 
up to expectations? 

CC: Yes. I started my research career on 
carbon nanostructures, and I saw a couple 
of those — nanotubes and fullerenes — just 
dying along the way. But graphene is a little 
bit different. Ten years ago we were dealing 
with tiny pieces of graphene, hardly visible 
under the microscope, so it was difficult and 
time-consuming to produce and process the 
material. Now we have centimetre-sized 
graphene sheets, graphene inks and many 
other types of 2D crystal. So lam more opti- 
mistic about graphene than other carbon 
nanostructures. 


How did you begin to shape the music? 

SL: The first thing I looked at was graphene’s 
molecular structure — the hexagonal honey- 
comb pattern — so I used the number six a 
lot in the music. It is a six-movement piece, 
the parts that are about graphene itself are in 
six-eight time, and I have tried to interpret 
the hexagon structure musically in one of the 
movements. 


And how will it work in performance? 

SL: Some of the audio will come from data 
on graphene that we have turned into waves, 
so we can play them as sound. Each move- 
ment is four to five minutes long. Some 
parts are orchestral; others will be played by 
my band. The classical element — a string 
quartet, with oboe and trumpet in some 
places — reflects the history of Manches- 
ter, its scientific achievements. The modern 
element — drums, bass, guitar and keyboard 
— represents the idea that this is a really for- 
ward-looking time, and that graphene is a 
wonderful discovery. The first movement is 
a journey representing ‘science at work’: the 
long hours, the commitment and the frustra- 
tions. The second movement goes on to the 
moment of discovery, the sheer excitement 
of it. The third is all the hard work that hap- 
pens once that discovery has taken place. 
Another is dedicated to women — it has a 
noble, keep-on-going attitude. 


What are your hopes for the Graphene Suite? 
SL: [hope that it will help to shine a spotlight 
on the spaces that are available for women 
in science and composition. We are both 
passionate about making sure that that is a 
legacy of the project. m 


INTERVIEW BY MARK PEPLOW 


This interview has been edited for length and clarity. 


JILL JENNINGS 
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Factor people into 
tiger conservation 


Tiger populations in India and 
Nepal have increased by almost 
one-third and two-thirds, 
respectively, over the past 5 years. 
But for these countries to achieve 
their goal of doubling their 
combined tiger numbers to more 
than 3,600 by 2022, they must 
take the human population into 
account as well. 

India is projected to become 
the world’s most populous nation 
by 2028, with around 1.45 billion 
people. This will place further 
pressure on tiger habitats such as 
those in Arunachal Pradesh and 
Uttarakhand, where hydropower 
projects are planned. 

Local village populations 
are unlikely to cooperate with 
tiger conservation efforts 
while the animals continue to 
take livestock and kill humans 
(N. H. Carter et al. Ambio 43, 
125-137; 2014). In India alone, 
tigers killed 822 people between 
1990 and 2009 (N. B.S. Chauhan 
Julius Kiihn Arch. http://doi. 
org/46k; 2011). 

It is crucial for the Indian and 
Nepalese governments to install 
protection for villagers against 
tigers and to provide them with 
incentives to participate in 
tiger conservation. Improved 
management of existing reserves 
and more conservation areas are 
also a priority. 

Achyut Aryal* Massey 
University, Auckland, 

New Zealand. 
a.aryal@massey.ac.nz 

*On behalf of 5 correspondents (see 
go.nature.com/cr4jli for full list). 


Healthy diet sustains 
the environment too 


The idea that dietary 
recommendations for humans 
are compatible with an 
environmentally sustainable 
food system is supported by 
a2015 report from the US 
National Academy of Sciences 
(go.nature.com/t2ovdt). 

The implication that dietary 


recommendations should take 
the environmental impact of food 
production into account caused 
controversy earlier this year. Parts 
of the US government and the 
food industry accused the federal 
Dietary Guidelines Advisory 
Committee of going beyond its 
remit in advocating a more plant- 
based diet that would be better for 
sustainability as well as human 
health (see go.nature.com/Icwj6i). 
Scientists, policy-makers and 
other stakeholders must move 
beyond considering health, 
environmental, social and 
economic effects in isolation. In 
recognizing the need to address 
the food system’s complexities 
and its competing demands and 
trade-offs, the National Academy 
of Sciences’ report has already 
been the subject of briefings to 
Congress and US governmental 
agencies. 
Malden Nesheim Cornell 
University, Ithaca, New York, USA. 
Patrick J. Stover Cornell 
University, Ithaca, New York; and 
American Society for Nutrition, 
Bethesda, Maryland, USA. 
Maria Oria The National 
Academies, Washington DC, USA. 
pjs13@cornell.edu 


Beware climate 
neo-scepticism 


We coin ‘neo-scepticisnY 

asa term to describe an emerging 
science-policy argument that 
opposes major efforts to mitigate 
climate change. Neo-sceptics 

do not deny anthropogenic 
global warming, but minimize 
its projected effects and see 
mitigation efforts as unjustifiable 
(see, for example, S. E. Koonin 
Wall Street Journal 19 September 
2014; J. Curry Wall Street Journal 
9 October 2014). 

Neo-sceptic arguments 
emphasize well-known 
uncertainties in climate science, 
particularly over predicted 
temperature changes. This 
diverts attention from the risks 
of events with extreme biological 
and economic consequences and 
from the risks of ‘tipping points’ 


with large, nonlinear responses. 
Neo-sceptics recognize 
the importance of policy 
decisions — who should do 
what, when, and at what cost, 
for example. However, they 
project an ironic certainty in 
advising the postponement of 
serious mitigation efforts until 
confidence intervals for climate 
projections are narrowed. 
Physicists, biologists and social 
scientists should be focusing 
more on risks, rather than simply 
pursuing more-precise estimates 
of physical parameters that are 
not linearly related to the most 
worrisome hazards. Society does 
not demand certainty to cope 
with other high-consequence 
risks, such as those relating to 
public health, airline safety, 
terrorism, and natural or 
technological disasters. 
John H. Perkins* The Evergreen 
State College, Kensington, 
California, USA. 
perkinsj@evergreen.edu 
*On behalf of 4 correspondents (see 
go.nature.com/mg7ugf for full list). 


Chinese scientists 
are sharing data 


We cannot agree with Zheng 
Wan's view that it is getting harder 
for scientists in China to access 
high-quality public data (Nature 
520, 587; 2015). There are still 
technical issues to be resolved, 
but the trend is towards a greater 
amount of open data — not less. 
The Chinese government has 
established several national data 
infrastructures over the past few 
years that are publicly accessible. 
These include the National 
Science and Technology 
Infrastructures organization 
(www.escience.gov.cn), which 
has been sharing data since 2009 
in fields such as Earth systems, 
marine science, meteorology, 
agriculture, forestry, medicine 
and health. One of its 23 projects 
is the National Specimen 
Information Infrastructure 
(www.nsii.org.cn), launched in 
2013, which currently archives 
data from more than 10.5 million 


biological, mineral, rock and 
fossil specimens. 

The degree and success of data 
sharing also rely on individual 
scientists’ attitudes and practices. 
Many researchers — including 
some who are publicly funded 
in China — unfortunately often 
resist sharing data, actively or 
passively. They should instead 
take more responsibility for 
cultivating a sustainable, 
‘bottom-up’ culture of sharing. 
Xiaolei Huang Fujian 
Agriculture and Forestry 
University, Fuzhou, China. 
Keping Ma Institute of Botany, 
Chinese Academy of Sciences, 
Beijing, China. 
huangxl@fafu.edu.cn 


Star academics in 
favour of divestment 


Your portrayal of the campus 
carbon-divestment movement 
could have provided a more 
balanced account of the 
controversy (Nature 521, 16-17; 
2015). The strongest resistance 
to this movement is found not 
among faculty members, but in 
the boardrooms of the managing 
corporations. 

As for the former, our 
own group, Harvard 
Faculty for Divestment 
(harvardfacultydivest.com), has 
grown in less than a year to more 
than 250 names, and includes 
several renowned climate 
scientists and Nobel laureates, as 
well as members of every school 
in the Harvard community. 
James Recht Harvard Medical 
School, Boston, Massachusetts, 
USA. 
james_recht@hms.harvard.edu 


CORRECTION 

In the Outlook article ‘Wild bees: 
Lone rangers’ (Nature 521, 
S62-S63; 2015), entomologist 
Jordi Bosch was incorrectly 
affiliated with the University of 
Barcelona; his affiliation should 
have read the Autonomous 
University of Barcelona. 
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A Mars-sized exoplanet 


Analysis of Kepler data has yielded the smallest known mass for an exoplanet orbiting a normal star. Its mass and size are 
similar to those of Mars, setting a benchmark for the properties of exoplanets smaller than Earth. SEE LETTER P.321 


GREGORY LAUGHLIN 


ith a mass one-tenth that of Earth’, 
half its diameter, and a residual 
gleam of long-lost habitability, 


Mars has an outsized hold on our imagination. 
On page 321 of this issue, Jontof-Hutter et al.’ 
report the discovery of Kepler- 138 b — an alien 
world that has a mass, a radius and perhaps 
an overall composition very similar to those of 
Mars. In contrast to warm Earth or cold Mars, 
however, this planet is baking hot because of 
its proximity to its host star. The current exo- 
planet census is dominated by planets a few 
times the size of Earth and on orbits close to 
their stars, and so our Solar System — with its 
four small inner planets — seems somewhat 
marginalized. The detection of such a small 
exoplanet in a tight orbit could help to clarify 
how we fit into the big picture. 

NASA’s successful Kepler space observa- 
tory made the observations that enabled the 
detection of this low-mass world. Between 
2009 and 2013, Kepler monitored more than 
150,000 stars in a patch of sky just above the 
mid-plane of the Milky Way, and recorded the 
periodic diminutions in brightness that occur 
when planets with appropriately aligned orbits 
transit (pass in front of) their parent stars”. At 
the last count, more than 4,600 candidate plan- 
ets had been identified in the data returned by 
the Kepler mission’. Although the bulk of these 
prospective worlds have radii 2-4 times that 
of Earth’s and display a bewildering range of 
compositions, more than 500 candidates have 
radii smaller than Earth’s. The race is on to ele- 
vate these candidates to confirmed planetary 
detections’. 

Observations of repeating dips in the bright- 
ness of a host star, each dip having identical 
depth and duration, might seem to constitute 
fail-safe evidence of an orbiting planet. There 
is, however, a host of potential false alarms 
that can compromise a detection, notably the 
possibility that a background, eclipsing binary 
star registers on the same detector pixel as the 
candidate planet’s host star, so that periodic 
eclipses in the distant system can mimic plan- 
etary transits of the target star’. So far, more 
than 1,000 of Kepler’s planets have been vet- 
ted and confirmed, but the process of follow- 
up evaluation (especially for the low-mass 
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Kepler 138 


Size of Mercury’s orbit Se 


Figure 1 | The planetary system Kepler-138. 
The graphic depicts the orbital architecture of an 
exoplanet system discovered by the Kepler space 
observatory. Jontof-Hutter et al.' report the sizes 
and masses of planets Kepler-138 b, Kepler-138 c 
and Kepler-138 d. Kepler-138b, ona 10.3-day orbit, 
has the smallest measured mass of any transiting 
exoplanet — it has a size and mass similar to those 
of Mars. Kepler-138 c and Kepler-138 d are slightly 
larger than Earth and have orbital periods of 

13.8 days and 23.1 days, respectively. The orbits are 
drawn relative to the size of Mercury’s 88-day orbit. 


worlds) will continue for decades. 

Transit detections provide an estimate of 
a planet’s size, but they give no information 
about its mass. For massive transiting planets, 
masses can be estimated by Doppler spectro- 
scopic monitoring of the host star’s velocity 
variations along the line of sight of the obser- 
vation. This option, however, is unavailable 
for a planet such as Kepler- 138 b, which is too 
small to induce detectable wobbles in the host 
star’s motion’. 

During the nineteenth century, Mars 
presented a similar challenge to astronomers 
who were trying to estimate its mass. In 1877, 
the discovery of the planet’s tiny satellites 
(Phobos and Deimos) allowed the mass of 
Mars to be determined accurately and directly. 
Earlier estimates, however, beginning with 
the efforts of Jean Baptiste Joseph Delambre 
in 1806, were obtained by carefully account- 
ing for Mars'’s gravitational influence on other 
planets in the Solar System, and calculating 
their effects on the precisely measurable daily 
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location of the Sun. By 1850, refinements of 
this technique had narrowed the Martian mass 
estimate to within 20% of its true value’. 

Jontof- Hutter and colleagues have drawn 
from the same time-tested notebook of 
dynamical astronomy to deduce the mass of 
Kepler-138b. The planet is part of a system 
(Fig. 1) that contains two additional transit- 
ing planets, Kepler-138c and Kepler-138 d, 
which have periods of 13.8 and 23.1 days, 
respectively, and radii slightly larger than that 
of Earth. The inner planet pairing between 
138 b and 138 c has an orbital-period ratio of 
nearly 4:3, whereas the outer pairing between 
138 c and 138d has a ratio near 5:3. Because 
the period ratios are almost exact ratios of inte- 
gers, this enables the gravitational distortions 
of the orbit of one planet by the other planets to 
gradually ebb and flow, producing observable 
departures from expected strict regularity in 
the transit times. Measurements of the planets’ 
transit timing variations constitute a nonlinear, 
computationally intensive inverse problem, the 
solution of which has allowed the authors to 
deduce the masses of all three planets. 

The radii of known exoplanets generally 
display surprisingly little correlation with 
their masses, especially for planets that are 
somewhat more massive than Earth. Jontof- 
Hutter et al. report a mass of around 0.07 Earth 
masses for Kepler-138b, imbuing it with a bulk 
density of about 3 grams per cubic centimetre 
— consistent with a purely rocky composition 
roughly akin to that of Mars. The planet’s mass 
is similar to, but somewhat less than that of 
Mars (by contrast, its surface temperature is 
much higher). Whether we are probing an out- 
lier world, or whether the authors’ discovery 
points to an important trend among low-mass 
planets — whereby smaller radii exhibit strong 
positive correlation with smaller masses — is 
as yet unclear. 

The derivation of dynamical properties 
from transit measurements requires timing 
data with split-second precision, and so the 
mass and density of Kepler-138 b both have 
large uncertainties, of a factor of about two. 
The prime drivers of the uncertainty are 
the relatively low number of planetary tran- 
sits that have been observed and the timing 
precision. These can be improved both with 
longer observation campaigns and, eventually, 


with space missions capable of observing at 
intervals shorter than Kepler’s 1-minute limit. 
The European Space Agency’s PLATO (Plane- 
tary Transits and Oscillations of Stars) mission 
planned for the mid-2020s, for example, will be 
able to monitor the brightest transit-bearing 
stars at a temporal resolution of 2.5 seconds. 
Kepler-138 b orbits a red dwarf star, the mass, 
radius, chemical composition and space motion 
of which are close to those of the ‘average’ star 
in the Milky Way. The Kepler-138 planetary 
system also seems to be unexceptional when 
placed in its galactic context. The Kepler mis- 
sion has demonstrated that it is extremely com- 
mon for stars to harbour multiple planets with 
orbital periods of less than 100 days and masses 
of lower than 20 Earth masses. By contrast, the 
reaches of the Solar System interior to Mer- 
cury, which has an 88-day orbit, are completely 
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empty. It is imperative to improve our under- 
standing of how our system's architecture and 
evolution fit into the overall census — the 
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Satellite) mission’, scheduled for 2017, and 
will help to provide an answer. = 


authors’ study is a step towards that goal. 

A pressing question is whether planets such 
as those orbiting Kepler-138 formed in situ, 
or whether they accumulated in colder, dis- 
tant regions of the protoplanetary disk before 
migrating inwards. Short-period, low-mass 
planets that cannot retain hydrogen atmos- 
pheres, and the densities of which are similar 
to mixtures of rock and ice, present evidence in 
favour of inward migration from cold regions. 
This is because planets with an icy bulk can- 
not have formed close to their star. Additional, 
increasingly accurate mass and radius meas- 
urements, both for Kepler-138b and for other 
small exoplanets, will be facilitated soon by 
NASAss TESS (Transiting Exoplanet Survey 


Receptors for 


selective recycling 


Two studies show that the engulfment of certain intracellular membranous 
structures by vesicles called autophagosomes regulates the structures’ degradation 
in a selective, receptor-protein- mediated manner. SEE LETTERS P.354 & P.359 


DAVID C. RUBINSZTEIN 


utophagy — the internal digestion of 
A= ofa cell — is an evolutionarily con- 

served pathway for recycling damaged 
or unwanted cellular components. Although the 
process was initially thought to involve degrada- 
tion en masse of cytoplasmic contents, evidence 
has emerged’ to suggest that individual compo- 
nents can be selectively isolated in preparation 
for autophagy. Two complementary papers in 


FAM134B 


LIR 
domain 


( / 


this issue*’ identify specific receptor proteins 
in yeast and mammalian cells that enable the 
selective degradation of an organelle called the 
endoplasmic reticulum (ER). 

During autophagy, cup-shaped double- 
membrane structures called phagophores 
develop into spherical structures called 
autophagosomes, which engulf cytoplasmic 
components destined for autophagic degrada- 
tion. Autophagosomes fuse with vesicles called 
lysosomes, which degrade and then recycle 


Lysosome | ~ 


ER fragment 


Phagophore 


Autophagosome 


Gregory Laughlin is in the Department 
of Astronomy and Astrophysics, University 
of California at Santa Cruz, Santa Cruz, 
California 95064, USA. 

e-mail: laughlin@ucolick.org 
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the engulfed components (Fig. 1). Selective 
autophagy of certain bacteria, aggregation- 
prone proteins and organelles such as mito- 
chondria seems to be mediated by autophagy 
receptor proteins. These bind to protein com- 
ponents of the phagophore and autophago- 
some, such as Atg8 in yeast and its equivalent 
in mammals, LC3, through interactions with 
LC3-interacting domains (called Atg8- 
binding pockets and LIR domains in yeast and 
mammals, respectively)’. 

The ER, which directs proteins towards the 
plasma membrane for secretion or residency, 
can also be degraded through autophagy (a 
process called ER-phagy)*. It has not previ- 
ously been possible to test whether ER-phagy 
is required for health, although in animal mod- 
els it seems to protect against a,-antitrypsin 
deficiency, a condition in which a toxic form 
of the a,-antitrypsin protein accumulates in 
the ER and causes liver damage’. But is ER- 
phagy merely part of bulk autophagy, or does 
some cellular machinery enable the prefer- 
ential selection of ER into autophagosomes, 
as has been previously hypothesized®? 


Autolysosome 


Figure 1 | Selective cellular recycling. During an intracellular recycling 
process called autophagy, structures called phagophores form structures 
called autophagosomes, which engulf damaged or unwanted intracellular 
components before fusing with lysosomes to form autolysosomes that degrade 
their contents. Khaminets et al.° report that, in mammals, the receptor 


protein FAM134B mediates autophagy of an organelle called the endoplasmic 
reticulum (ER). ER-bound FAM134B interacts with a protein called LC3 in 
phagophores through an LIR domain, leading to engulfment of ER fragments 
by autophagosomes. Mochida et al.’ report that Atg39 and Atg40 have 
equivalent roles in yeast (not shown). 
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The current studies provide robust support 
for this hypothesis. The key starting point for 
both groups was the identification of autophagy 
receptors that interact with the ER and with 
autophagosomes, which they achieved by 
searching for proteins that physically interact 
with LC3 or Atg8. Khaminets et al.’ (page 354) 
report that FAM134B, which has an LIR 
domain and a reticulon domain that binds 
to and fragments the ER, interacts with LC3 
in mammalian cells. In yeast, Mochida et al. 
(page 359) demonstrate that Atg39 and Atg40 
(which seems to bea counterpart of FAM134B), 
interact with Atg8 and the ER. The groups 
report that loss of Atg39, Atg40 or FAM134B 
impedes autophagy-dependent degradation 
of the ER, but does not affect bulk autophagy. 
Furthermore, mutant forms of the proteins that 
do not bind to the phagophore and autophago- 
some are ineffective in ER-phagy. 

Mochida et al. showed that Atg40 is 
associated with the cytoplasmic ER and the ER 
close to the plasma membrane and seems to 
be involved in their degradation. By contrast, 
Atg39 is associated with the ER that surrounds 
the nucleus. Atg39 regulates degradation of 
both this nuclear envelope and some nuclear 
constituents — a previously unidentified 
form of selective autophagy that the authors 
call nucleophagy. It will be interesting to 
determine whether nucleophagy occurs in 
mammalian cells, and to define when the pro- 
cess occurs, which components are degraded, 
and why it is required. For instance, nucleo- 
phagy might contribute to the breakdown of 
the nuclear envelope that occurs when cells 
divide. Nucleophagy could also theoretically 
be responsible for quality control and repair 
of damaged nuclear membranes. This could 
be tested by investigating whether impaired 
nucleophagy results in the accumulation of 
malfunctioning pores in the nuclear envelope 
and compromised segregation of nuclear and 
cytoplasmic contents. 

Khaminets and colleagues provide data 
suggesting that FAM134B helps to fragment 
the ER into ‘bite-sized’ pieces that can be 
readily incorporated into autophagosomes. 
This model is analogous with the hypothesis’ 
that the division of mitochondria enables their 
own sequestration and degradation through 
autophagy. The role of FAM134B is of particu- 
lar interest, because it is mutated in a human 
disorder of the sensory nerves in which pain 
perception is impaired, resulting in ulcera- 
tion of the hands and feet®. Khaminets et al. 
report that mice lacking FAM134B have simi- 
lar sensory defects to those seen in the human 
disorder. In these mice, ER accumulates in the 
cell bodies of pain-sensing neurons before 
any morphological abnormalities are appar- 
ent, raising the interesting possibility that 
these neurons may be particularly vulnerable 
to defective ER-phagy. 

The suggestion that ER-phagy might be 
required for cellular health is supported by 
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Mochida and colleagues’ observation that 
Atg39-deficient yeast are more sensitive than 
wild-type cells to the stress of nitrogen starva- 
tion. However, the possibility cannot be ruled 
out that sensory abnormalities are mediated 
by autophagy-independent roles of FAM134B. 
If the sensory abnormalities observed in mice 
are directly due to the increase in ER volume, 
future studies should be able to determine 
whether the defects are caused by the toxic 
effects of the uncleared organelles, or whether 
the increased ER volume affects cellular signal- 
ling and neurotransmission. 

The two complementary studies” have 
identified seemingly analogous systems for 
regulating ER degradation through autophagy, 
which will lead to a deeper understanding 
of the process. Goals for the future include 
understanding the relevance of the ER-phagy 
machinery to maintenance of the nucleus in 
mammalian cells, investigating the conse- 
quences of impaired or excessive ER-phagy, and 
defining how the process is regulated. The activ- 
ities of many autophagy receptors can be modu- 
lated by post-translational modifications such 
as phosphorylation’, and it will be interesting 
to find out whether such mechanisms are per- 
tinent to ER-phagy. The idea that the activity of 
receptors involved in ER-phagy can be regulated 
is supported by Mochida and colleagues’ obser- 
vation that levels of Atg39 and Atg40 increase 
when cells are treated with the drug rapamycin, 
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which induces some of the signalling changes 
associated with nitrogen starvation. 

Finally, it remains to be seen whether there 
is any specificity to which parts of the ER 
and nuclear envelope are cleared by selective 
autophagy. If there is, what is selected and 
why? It will be important to discover whether 
ER-phagy is indiscriminate, or whether the 
receptors target damaged ER in the way that 
damaged mitochondria are preferentially 
degraded by mitophagy and intracellular 
pathogens by xenophagy’. m 
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Round the bend 
with microwaves 


Simulations reveal that microwaves propagating through a waveguide can travel 
around sharp bends in the device without being reflected. The finding might 
open the way to technologies that exploit this uncommon phenomenon. 


SUNIL MITTAL & MOHAMMAD HAFEZI 


systems do not always perform accord- 

ing to expectations. Fabrication errors 
can give rise to defects that affect the propaga- 
tion of electromagnetic waves — from radio 
to optical wavelengths — through the wave- 
carrying medium, causing undesired effects. 
For example, structural defects introduce noise 
in the mediums properties that can, in extreme 
cases, completely block photon transmission. 
Efforts are therefore under way to engineer 
systems whose topology (rather than shape) 
governs photon transport, making them 
immune to such effects. These efforts rely 
on results'” showing that, as photonic sys- 
tems undergo a certain type of continuous 
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deformation, their properties remain intact 
and photon transport becomes topologi- 
cally protected. Writing in Physical Review 
Letters, Ma et al.* demonstrate this theo- 
retically for a waveguide design. Their 
system could enable the topologically pro- 
tected transport of microwave photons, 
which would be able to travel around sharp 
corners without being reflected backwards. 

In the early 1980s, physicists realized that 
certain physical properties of electronic sys- 
tems are entirely dictated by the topology of 
the material concerned’. A hallmark discovery 
was that, in the presence ofa strong magnetic 
field, the transverse electrical conductance 
of a material becomes quantized (the quan- 
tum Hall effect). In materials that exhibit 
this phenomenon, conduction is remarkably 
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Figure 1 | Reflection-free microwave propagation in a waveguide. Ma et al.’ have simulated the 
propagation of microwaves in a metawaveguide. The proposed device contains a hexagonal array of metal 
cylinders displaying an asymmetry along the cylinder axis — a gap between one end of the cylinders and 
the enclosing metal plate. The two sections shown are joined together such that the position of the gap 

is inverted between them (on the left the gap is at the bottom, whereas on the right it is at the top). This 
configuration allows microwaves to travel along the bent interface between the two sections with negligible 


backward reflection at the two 120° turns. 


robust against defects and disorder. 

The electronic states in the interior of such 
systems become localized so that the bulk of 
the material becomes an insulator. By con- 
trast, the boundary of the material supports 
extended one-way circulating electronic states, 
known as chiral edge states, that mediate con- 
duction. The system’s topology determines 
whether edge states occur; materials that 
allow them are called topological insulators. 
A remarkable property of edge states is that 
they allow electrical current to route around 
disorders, with no backscattering of electrons. 

It was initially thought that topological 
insulators could be generated only in the pres- 
ence of a magnetic field, which specifically 
breaks the time-reversal symmetry of the 
system. However, in 2005, Charles Kane and 
Eugene Mele showed’ that similar topologi- 
cally robust properties can be achieved through 
spin-orbit coupling of electrons — that is, 
coupling of the electrons’ orbital angular 
momentum with their spin angular momen- 
tum — while preserving the system’s time- 
reversal symmetry. This opened up a wider 
range of systems that can have topologically 
protected states. Since then, topological insula- 
tors have attracted great interest, both for their 
intriguing fundamental properties and for their 
potential application in quantum computation. 

Photonic systems can exhibit topological 
features analogous to those of electronic sys- 
tems®*. Ma et al. simulate a system that acts 
like a topological insulator for microwave 
radiation. The authors consider a device 
called a parallel-plate metawaveguide, which 
consists ofan array of centimetre-scale metal- 
lic cylinders arranged in a hexagonal lattice, 
bounded on either side by metal plates (Fig. 1). 
For microwaves that have wavelengths of about 


15cm, this array constitutes a ‘metamaterial’ 
in which the refractive index varies ona scale 
comparable to the wavelengths of microwaves. 

The metawaveguide supports two decoupled 
polarization modes of the microwaves electro- 
magnetic fields — the transverse electric (TE) 
and transverse magnetic (TM) modes. These 
can propagate only in a plane perpendicular 
to the cylinder axis. The hexagonal geometry 
of the cylinder lattice causes the emergence of 
Dirac cones in the metawaveguide’s energy- 
band structure — cone-shaped bands that 
meet at their tips. At the tips of the Dirac cones, 
the TE and TM modes have the same energy, 
and the velocity with which the wave envelopes 
propagate is constant near this point. 

Maand colleagues show that introducing an 
asymmetry in the metawaveguide'’s geometry, 
in the form ofa gap between the cylinder array 
and one of the enclosing plates, couples the 
TEand TM modes. This change in the array’s 
configuration modifies the energy-band struc- 
ture of the device, causing a bandgap to open 
up between the Dirac cones. The resulting 
bandgap has non-trivial topological proper- 
ties analogous to those described by the elec- 
tronic Kane—Mele model. In other words, the 
metawaveguide becomes a photonic analogue 
ofa topological insulator. Here, symmetric and 
antisymmetric combinations of the TE and TM 
modes, which are time-reversed counterparts, 
constitute a pseudo-spin, and the asymmetry 
induces spin-orbit interaction. 

In further simulations, Ma et al. show that if 
two such metawaveguides are coupled to each 
other in a configuration of opposing topologi- 
cal indices — for example, such that the posi- 
tion of the gap at the tips of the metal cylinders 
is inverted between the two devices — the 
interface between them displays intriguing 
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properties. Specifically, the authors observe 
topologically protected edge states of oppos- 
ing chirality (handedness), which propagate in 
opposite directions along the interface. They 
show how these states can be excited by placing 
a suitably polarized electric or magnetic dipole 
inside the gap of the metawaveguide that is 
adjacent to the interface. For example, a left- 
polarized dipole excites microwaves that always 
propagate in the backward direction, whereas 
a right-polarized dipole excites forward-only 
waves. These waves are immune to backscatter- 
ing and can take sharp turns along the interface. 
The authors’ simulations demonstrate this for 
an interface bent by 120°. 

However, these edge states are not robust 
against all types of disorder. For example, a 
disorder that flips the pseudo-spin of a state 
can couple a forward-propagating state to a 
backward-propagating one, destroying the 
edge states’ protection against backscatter- 
ing. This is analogous to the effect of magnetic 
impurities in materials exhibiting the ‘quan- 
tum spin Hall effect’ against which edge states 
are not protected. In the wavelength domain 
applicable to telecommunications (about 
1,550nm), the effects of such pseudo-spin flips 
are negligible’, but it is not known whether the 
effect is also small in the microwave domain. 
Future work may clarify this. 

An experimental realization of the effects 
modelled in Ma and colleagues’ work would 
open up new ways to engineer electromagnetic 
structures that would benefit from topological 
protection’®"’. For example, one could explore 
designs for microwave devices such as wave- 
guides, filters, antennas and amplifiers that 
would make the devices insensitive to design 
defects and that could have applications in 
telecommunications technology. m 
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The power of positivity 


Stimulating neurons in the brain’s hippocampus that are normally 
activated by pleasurable experiences protects mice from the depressive 


consequences of stress. SEE LETTER P.335 


ALEX DRANOVSKY & E. DAVID LEONARDO 


ur ability to construct a personal 

narrative and to establish a sense of 

identity depends on our recollection 
of past events, the most significant of which are 
inevitably associated with emotions. Under- 
standing how emotion-laden memories affect 
behaviour forms the bedrock of psychotherapy 
treatments, but the biology of this process is 
poorly understood. On page 335 of this issue, 
Ramirez et al.' shed light on the subject, dem- 
onstrating that depressive-like stress responses 
in mice can be acutely suppressed through 
artificial reactivation of a small population of 
neurons that had previously been activated by 
a positive experience. 

Understanding how memories are stored 
in the brain has been the subject of intensive 
research”, much of which has focused on the 
hippocampus — an elongated structure in 
the brain’s temporal lobe that is essential for 
forming long-term memories and regulating 
responses to stress. The hippocampus contrib- 
utes to the generation of cellular representa- 
tions of experiences through the activation of 
ensembles of neurons. In mice, ensembles in 
a hippocampal subregion called the dentate 
gyrus can be genetically tagged with light- 
sensitive molecules while being activated by 
an experience’. Subsequent experimental 
reactivation of the ensembles using pulses of 
light causes the animals to behave in a way that 
suggests they are recalling the original experi- 
ence’. Asa result, these neuronal ensembles are 


a Positive b Stress 


experience 


thought to constitute the physical substrate of 
memory, called an engram. 

Ramirez et al. used a previously described 
system’ to selectively express light-sensitive 
molecules in a subset of mouse dentate-gyrus 
cells while the cells were being activated by 
experiences, tagging them for optical reacti- 
vation at a later time. The authors tagged these 
cells in male mice under three conditions: a 
rewarding experience, in which the mice were 
placed in a cage with a female; a neutral expe- 
rience, in which they were placed in an empty 
cage; or a negative experience, in which they 
were immobilized, leading to stress. The ani- 
mals were subsequently exposed to stress for 
10 consecutive days, which induced increased 
anxiety and depression-related behaviours 
such as passivity and decreased preference for 
enjoyable activities (anhedonia). 

The investigators found that the depression- 
like behaviours seen in animals that had been 
subjected to this stressing could be reversed by 
optically reactivating dentate-gyrus neurons 
tagged during the positive experience, whereas 
activation of neutral- or negative-tagged neu- 
rons did not have this effect (Fig. 1). 

Repeated exposure to positive imagery has 
been used in attempts to battle the symptoms 
of human depression’. Could repeated activa- 
tion of a positive engram provide enduring 
protection from the depression-like conse- 
quences of stress in mice? Ramirez and col- 
leagues found that when they reactivated 
the positive engram every day for five days 
in previously stressed animals, passivity and 
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Figure 1 | Light-induced stress relief. a, Ramirez et al.' genetically tagged neurons in the dentate- 

gyrus region of the brains of male mice while the neurons were being activated (indicated in blue) by the 
positive experience of being in a cage with a female, or by a negative or a neutral experience (not shown). 
b, The mice were subsequently subjected to repeated immobilization, causing stress that led to to anxiety 
and depression-related behaviours. c, The depression-like behaviours associated with stress could be 
reversed by reactivating the positive-tagged neurons with pulses of blue light, but reactivating the neutral- 


or negative-tagged neurons did not have this affect. 
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anhedonia were reversed on day six. Impor- 
tantly, animals exposed to the positive experi- 
ence itself for five days after the repeated stress 
did not show this enduring protection. More- 
over, the decreased production of new neurons 
that is routinely observed following stress was 
reversed by repeated engram stimulation, but 
not by the repeated positive experience. 

Ramirez and colleagues’ provocative results 
raise several questions relevant to stress- 
related disorders in humans. Both anxiety 
and depression are associated with repeated 
exposure to stress, and the two disorders often 
occur together. However, acutely activating 
the dentate-gyrus engram in mice primarily 
reversed depression-associated behaviours. 
Might an engram that can affect anxiety be 
found elsewhere in the hippocampus or other 
regions of the brain? 

Because a single reactivation of the positive 
engram did not relieve anxiety, the authors did 
not investigate the effects of repeated engram 
stimulation on this trait. But although fast- 
acting treatments for depression and anxiety 
differ, chronic treatments for the two condi- 
tions overlap. Could repeated activation of 
the positive engram, unlike the single inter- 
vention, prevent anxiety-associated manifes- 
tations of stress? Finally, it will be interesting 
to investigate the other side of the coin — 
whether reactivation of traumatic memories is 
sufficient to cause stress-related disease. A first 
step to exploring this avenue in mice would 
be to test whether reactivating an ensemble 
of dentate-gyrus neurons tagged by negative 
experiences could cause anxiety or depression- 
like behaviours. 

Why is reactivating an engram ofa positive 
experience a more effective stress-reliever than 
the experience itself? We are left wondering 
whether the repeated recall of positive memo- 
ries will result in resilience to adversity. It is 
intriguing to speculate that nostalgia serves a 
similar stress-reducing purpose in humans*”. 
Perhaps an experience itself results simply 
in memory storage, whereas its recollection 
activates a neuronal network associated with 
reward, thereby changing behaviour. 

If, as has been suggested’, stress results ina 
cognitive bias against rewarding associations, 
then reactivation of a ‘positive’ engram laid 
down in an animal or human that was previ- 
ously stressed would not combat the con- 
sequences of later stress — a hypothesis that 
could easily be tested in the authors’ animals. In 
support of this suggestion, Ramirez et al. found 
that inhibiting a component of the reward 
neuronal circuitry while activating a positive 
engram interfered with the antidepressant 
properties of reactivating that engram. Consist- 
ent with this, a study’* of people with depression 
suggests that, unlike healthy individuals, they 
cannot improve their mood by recalling posi- 
tive memories. So the ability to remember your 
way to happiness may be a normal regulatory 
process that is disrupted in depression. m 
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Growing feedback from 
ocean carbon to climate 


The finding that feedbacks between the ocean’s carbon cycle and climate may 
become larger than terrestrial carbon- climate feedbacks has implications for the 
socio-economic effects of today’s fossil-fuel emissions. 


FORTUNAT JOOS 


lobal warming reduces the uptake 

of carbon dioxide by the ocean and 

the land biosphere. This allows more 
human-caused CO, to remain in the atmos- 
phere, where it causes more global warming 
— an amplifying climate-carbon feedback. 
The amplifying feedbacks associated with car- 
bon in vegetation and soils are conventionally 
thought to be several times larger than those 
associated with carbon in the ocean. Writing 
in Global Biogeochemical Cycles, Randerson 
etal.' report that this idea needs to be revisited 
and refined: in the long term, the ocean will 
have a larger role than land in climate-carbon 
coupling. 

The authors combined results from 
long-term simulations using an Earth- 
system model — specifically the 
Community Earth System Model 
(CESM)* — with an established 
analysis’ of feedbacks in the carbon- 
climate system. They ran the model up to 
the year 2300 using contrasting set-ups in 
which the greenhouse-gas forcing of CO, 
was turned on or off. They report that the 
ocean carbon-climate feedback (meas- 
ured as the difference in the amount of 
carbon cumulatively taken up by the 
ocean for the set-up that accounts for 
forcing from all the main anthropogenic 
emissions, including CO,, and for the 
set-up that omits such forcing) is 3% for 
1999 and increases to a staggering 23% 
by 2300. Surprisingly, the ocean-climate 
feedback becomes larger than the land- 
climate feedback beyond 2100 (Fig. 1). 

This apparent conflict with the con- 
ventional view of a smaller ocean than 
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carbon from the sea surface to the deep 
ocean. Land carbon-climate feedbacks are 
therefore larger in the near term than marine 
ones, but this situation will reverse in the 
long run. 

Ocean carbon uptake is primarily a physico- 
chemical process in which the rate-limiting 
step is the transport of water from the sur- 
face to the deep, and in which the reaction 
of CO, with water releases hydrogen ions, 
causing ocean acidification’. Randerson and 
co-workers report that warming decreases 
the solubility of CO, in water, reduces the 
density of surface waters and slows the over- 
turning circulation (the movement of surface 
water from high latitudes to the deep ocean, 
compensated for by rising waters from other 
latitudes). These effects reduce the over- 
all uptake capacity of the ocean for, and the 
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Figure 1 | Feedback between climate and the carbon 
cycle. Randerson et al.' have used a computational model to 
show that the ocean’s contribution to feedback between climate 
and the carbon cycle increases considerably over time, and 
exceeds contributions from land by the year 2100. The bar 
chart shows, for different periods, the reduction in the uptake 

of anthropogenic carbon (measured in petagrams of carbon; 

1 petagram is 10" grams) by the ocean and the land biosphere 
that is attributable to climate change, per increase in global mean 
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surface-to-deep transport of, anthropogenic 
carbon. The magnitude of the ocean-climate 
feedback reported by the researchers, and 
the main proposed feedback mechanisms — 
including a near-collapse of the Atlantic over- 
turning circulation and a small role for marine 
biological processes — are consistent with 
the findings of earlier studies (see ref. 6, for 
example) that used models with simpler ocean 
dynamics than those in the CESM. 

On land, the authors find that a loss of 
carbon results from increased rates of soil- 
carbon decomposition due to warming. This 
dominates the stimulation of plant productiv- 
ity caused by increased moisture and growing- 
season length at high latitudes. In addition, 
human land use reduces the amount of carbon 
in vegetation in their scenario, further limit- 
ing the potential amplifying feedback between 
land carbon and climate. 

The representation of biogeochemical 
processes in the CESM is advanced, but many 
feedbacks to those processes are uncertain or 
not represented. These include’ the impacts of 
ocean acidification, warming, increased levels 
of dissolved CO, anda potential increase in the 
volume of low-oxygen zones on marine eco- 
systems, biological productivity, the produc- 
tion of the greenhouse gas nitrous oxide, the 
export of biogenic particles from the surface 
to the deep ocean, and their sinking velocity 
and decomposition rate. On land, thawing of 
permafrost carbon may lead to the release of 
CO, (ref. 8) and methane’, processes 
not accounted for in the version of the 
CESM used by the authors. More work is 
needed to address changes in the extent 
of wetlands and peatlands in model 
projections, and to better quantify emis- 
sions of methane, nitrous oxide and 
organic compounds from the land bio- 
sphere, because these changes and emis- 
sions all contribute to climate forcing”. 

Data for past variations in CO, levels 
and climate can help us to avoid rely- 
ing exclusively on models to project 
future trends. Although palaeoclimatic 
information cannot be directly used to 
assess how climate change affects the 
flow of anthropogenic carbon, it does 
reveal how the natural carbon cycle 
alters. Palaeoclimatic data document 
that the ratio of the change in atmos- 
pheric CO, concentration to the change 


land feedback’ results from differ- 
ences in timescales. Most carbon cycles 
through plants and soils within decades, 
whereas it takes centuries to transport 


surface air temperature of 1°C. The authors computed these 
sensitivities from differences in cumulative carbon flux and 
global mean surface air temperature between a simulation in 
which CO, influences climate and another in which the direct 
influence of CO, on global warming is suppressed. 


in global mean surface air temperature 
is increasing with the timescale of cli- 
matic variations, and that the increase 
of CO, on decade-to-century timescales 
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is probably less than 20 parts per million for 
each degree of warming”. 

Further constraints on carbon-climate 
interactions arise from isotope data that allow 
terrestrial and oceanic greenhouse-gas sources 
to be disentangled. Carbon isotope data show 
that decadal CO, variations during the past 
millennium were predominantly driven by 
land-carbon variations", consistent with 
Randerson and colleagues’ finding of a small 
near-term marine feedback. But isotope data 
for nitrous oxide question the assumption 
made in the CESM that nitrogen availability 
strongly limits plant growth. These data show 
that the land source of nitrous oxide increased 
during the last deglaciation, which implies 
that nitrogen flow through the land biosphere 
increased and that nitrogen limitation was 
small’*, A long-standing research challenge 
is to develop Earth system models that per- 
form seamless simulations from the past to the 
future, consistently integrating palaeoclimatic 
and modern instrumental information in 
projections. 

Randerson and colleagues’ work highlights 


close the currently available window to meet 
stringent climate targets”. m 
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Aggregates feel 


the strain 


another research front. The marine carbon 
climate feedback increases in proportion to 
ocean-heat content in their model. Researchers 
would benefit from observational and mod- 
elling studies in which spatial and temporal 
variations of physical and biogeochemical 
parameters, such as temperature, carbon and 
nutrients, are analysed jointly to draw a more 
complete and self-consistent picture of the 
changing ocean. Biogeochemical variables are 
particularly sensitive indicators of change, but 
ocean physics governs both carbon and heat 
uptake — and therefore sea-level rise, climate 
change and their impacts. 

The current study acts as a timely reminder 
of several policy-relevant findings, ahead 
of the conference to be held by the United 
Nations Framework Convention on Climate 
Change in Paris this December. First, the 
larger the amplifying carbon-climate feedback 
is, the more stringent will emission reduc- 
tions have to be to stabilize climate at any 
given level. Second, shifting the peak in car- 
bon emissions to after 2100 imposes on future 
generations not only the burden of implement- 
ing a carbon-free economy, but also global 
warming of about 10°C above preindustrial 
temperatures’. 

Third, carbon emitted today will change 
our environment irreversibly for many gen- 
erations to come, and these changes increase 
hand in hand with cumulative carbon emis- 
sions’*. Sea level and the ocean's acidity and 
carbon and heat content — and the associated 
adverse effects — will continue to increase long 
after atmospheric CO, levels have stabilized, 
underscoring the need for near-term emission 
reduction. Finally, carbon-emission reductions 
are urgently needed if we are to limit global 
warming and ocean acidification to moderate 
levels. Any delay will narrow and eventually 
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Aggregates of «-synuclein protein can form in various cell types and cause 
different neurodegenerative disorders. The existence of strains with distinct 
structural conformations might explain this variability. SEE LETTER P.340 


SEUNG-JAE LEE & ELIEZER MASLIAH 


arkinson’s disease, dementia with Lewy 
bodies and multiple system atrophy are 


devastating, progressive, age-related 
neurodegenerative disorders that are char- 
acterized by the accumulation of clumps of 
a-synuclein protein in brain cells’. These dis- 
eases, dubbed synucleinopathies, can be dif- 
ferentiated from one another by the distinct 
anatomical patterns and cell types in which 
a-synuclein is deposited’. On page 340 of this 
issue, Peelaerts et al.* propose that different 
‘strains’ of a-synuclein, which adopt different 
structural conformations, might explain this 
variability. 

According to current models, deposits of 
a-synuclein are first laid down in the synaptic 
connections between neurons during the early 
stages of synucleinopathy*. Deposition later 
spreads to other sites: to neuronal cell bodies, 
where they form aggregates called Lewy bodies; 
to neuronal projections called axons and den- 
drites, forming aggregates called Lewy neurites; 
and to cells known as oligodendrocytes that 
support and insulate axons, forming aggregates 
called glial cytoplasmic inclusions (GCIs)’. 

These GCIs are the predominant deposits 
in people with multiple system atrophy’. Lewy 
bodies and Lewy neurites are characteristic of 
both Parkinson's disease and dementia with 
Lewy bodies — although the patterns in which 
the proteins spread vary between these two 
diseases. Furthermore, patterns of a-synuclein 
deposition, and consequently clinical 
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symptoms, can vary between individual cases 
of the same disease’. But the reasons underlying 
this variability have remained unclear. 
Synucleinopathies have several features 
in common with prion diseases, which are 
neurodegenerative diseases caused by the 
accumulation of misfolded prion protein in the 
brain. For example, the symptoms, biological 
abnormalities and incubation times of both 
types of disease are variable. In prion diseases, 
variability between individuals depends on the 
conformation adopted by aggregates of prion 
protein. Aggregates of a specific conformation 
are thought to propagate throughout the brain 
by inducing normal prion proteins to clump 
together into misfolded aggregates of the same 
conformation — a concept known as ‘seeded 
aggregation. Evidence suggests’ that aggregates 
of a-synuclein and other neurodegeneration- 
associated proteins spread through brain cells 
in a similar way, which might explain the 
varied patterns in which deposition spreads. 
In support of this hypothesis, two distinct 
aggregates of a-synuclein, fibrils and ribbons, 
have been characterized’. These aggregates 
faithfully propagate their structural features 
through seeding, and induce different abnor- 
malities in cultured cells. Following up on 
these findings, Peelaerts et al. investigated 
whether ribbons and fibrils are genuine pro- 
tein strains by testing whether they cause 
different symptoms and biological changes in 
animals. The authors injected rat brains with 
four different types of a-synuclein: ribbons; 
fibrils; homogenized brain samples from mice 
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Figure 1 | Structural variability and disease. The protein a-synuclein can adopt different structural 
conformations. Monomers of the protein interact with each other to form complexes called oligomers, 
which in turn are converted into either fibrils or ribbons. Furthermore, it is possible that fibrils are 
converted into ribbons. Peelaerts et al.’ report that the different conformations have different properties: 
fibrils primarily cause neurodegeneration and behavioural defects when injected into the brains of rats, 
whereas ribbons mostly cause the accumulation of protein aggregates called Lewy bodies, Lewy neurites 
and glial cytoplasmic inclusions (GCIs). Structures are not to scale. 


that produce a human a-synuclein aggregate; 
and short strings of abnormal a-synuclein 
protein called oligomers, which might give 
rise to the more complex ribbon and fibril 
structures®. They then analysed a-synuclein 
deposition in these animals. 

Ribbons caused more deposition than other 
types of a-synuclein. By contrast, and consist- 
ent witha study in mice’, fibrils more potently 
promoted other hallmarks of neurodegenera- 
tive disease, such as degeneration of dopamine 
axons in the striatum region of the brain and 
deficits in motor function. Fibrils were more 
stable than ribbons, which generated struc- 
tures resembling Lewy bodies and Lewy 
neurites. These findings indicate that the 
aggregates induced by ribbons and fibrils are 
indeed distinct. When they injected ribbons 
into rats that had been engineered to produce 
too much a-synuclein, Peelaerts et al. observed 
the generation of GCIs, but inoculation with 
fibrils did not have this effect. Together, these 
results suggest that different synucleinopathies 
are defined by different strains of a-synuclein 
aggregate (Fig. 1). 

The researchers also demonstrated that 
intravenous inoculations of aggregates that 
had been preformed in vitro led to the deposi- 
tion of a-synuclein in the central nervous sys- 
tem of animals. Blood contains relatively large 
amounts of a-synuclein”, so this result sug- 
gests that, if changes in this peripheral system 
affect conformations of a-synuclein, this could 
translate into specific neuronal abnormalities. 

Peelaerts and colleagues’ study has several 
implications. First, neurodegeneration and 
behavioural deficits seem to be associated pri- 
marily with fibrils, whereas the formation of 
Lewy bodies, Lewy neurites and GCIs is asso- 
ciated more with ribbons. Second, the distinct 
patterns of a-synuclein deposition observed 
after cerebral inoculations of fibrils and 
ribbons suggest that different strains of 
aggregate are the explanation for the vari- 
ability of synucleinopathies. Third, strains 
might not have to be generated in the 
central nervous system, or even in neurons, 
but instead could be transported there after 


being generated in the periphery. 

Despite Peelaerts and colleagues’ findings, 
and advances made by other groups”, it 
remains to be definitively proved that injected 
aggregates can amplify through seeding. An 
alternative explanation is that the injected 
aggregates lead to changes such as inflam- 
mation in the microenvironment of the brain 
region into which they are injected, and that 
this could eventually result in distinct abnor- 
malities. Synucleinopathies are accompa- 
nied by brain inflammation in humans, and 
the authors showed that injecting different 
a-synuclein aggregates caused brain inflam- 
mation to different extents. In vitro amplifi- 
cation and structural characterization of the 
amplified aggregates from brain tissues could 
provide the much-needed proof of aggre- 
gate seeding’. Whether ribbons or fibrils 
are sustained if brain tissue from the infected 
animal is injected into another animal also 
needs further validation to confirm that these 
aggregates are indeed strains. 

Can strains generated in the blood induce 
a-synuclein aggregation in the brain and cause 
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neurodegeneration? Peelaerts and colleagues’ 
results indicate that this is a possibility. The 
study also suggests that the blood of people 
with Parkinson's disease, dementia with Lewy 
bodies or multiple system atrophy might con- 
tain specific strains of a-synuclein that can 
serve as biomarkers for these diseases. 

The results raise the question of whether 
a-synuclein pathology can be transmitted 
between humans through blood. So far, there 
is no evidence for such a phenomenon, but 
future studies should address this question. = 
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Peculiar boron 
startles again 


Boron’s unusual properties inspired big advances in chemistry. A compound in 
which boron binds two carbon monoxide molecules reveals another oddity — 
the element forms bonds similar to those of transition metals. SEE LETTER P.327 


GERNOT FRENKING 


ransition-metal compounds take part in 
a wide range of reactions and catalytic 
cycles as a result of the metals’ abilities 
to form complexes with several ligands 
through a process known as donor-acceptor 
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bonding. Many transition metals can bind 
several carbonyl ligands (carbon monox- 
ide molecules). By contrast, the main-group 
elements — those found in groups 1, 2 and 13 
to 18 of the periodic table, excluding hydrogen 
— were fora long time thought not to be able 
to form complexes that bind more than one 
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carbonyl. Writing on page 327, Braunschweig 
et al.' report the synthesis of a compound, 
known as a borylene adduct, in which an 
atom of the main-group element boron carries 
two carbonyl ligands, and which has remark- 
able similarities to transition-metal carbonyl 
complexes. This provides striking evidence of 
donor-acceptor bonding in the chemistry of 
main-group elements. 

Boron forms compounds in which the 
element exhibits bonding patterns and reactiv- 
ities not found for other elements. Indeed, two 
Nobel prizes for chemistry have been awarded 
for discoveries involving this property of 
boron. In 1976, William Lipscomb received the 
award for his work on the structure of boranes 
(boron-hydrogen compounds), which illumi- 
nated problems of chemical bonding’. Three 
years later, Herbert Brown was recognized for 
his research into organoboron compounds 
(sharing the prize with Georg Wittig for his 
related groundbreaking discoveries in the 
area of phosphorus chemistry)’. A third Nobel 
prize may also be partly ascribed to discover- 
ies in boron chemistry: in 2010, Akira Suzuki 
received the chemistry award for his studies 
of a carbon-carbon bond-forming reaction 
that uses boron compounds, together with 
Richard Heck and Ei-ichi Negishi for their 
work on other carbon-carbon bond-forming 
reactions’, 

What makes boron compounds so peculiar? 
Boron atoms have three valence electrons, 
which can create three bonds by forming pairs 
with electrons from monovalent groups (R). 
The resulting borane (BR;) molecules thus 
have six valence electrons at the boron atom. 
This is two electrons short of a full valence shell 
of eight electrons, the number required to give 
a stable molecule according to the octet rule 
for main-group atoms. Boranes are therefore 
Lewis acids, compounds characterized by a 
‘vacancy’ in the valence shell — an absence of 
a pair of electrons. This vacancy is associated 
with an empty molecular orbital at the boron 
atom (Fig. 1a). 

Lewis acids typically bind strongly to Lewis 
bases such as amines (NR;), in which the 
nitrogen atom has a ‘lone pair’ of electrons 
in a molecular orbital. The vacancy in the 
valence shell of Lewis acids can be filled by the 
lone-pair molecular orbital of Lewis bases, 
yielding a donor-acceptor bond (Fig. 1b). 
The resulting compounds are well known 
to chemists. 

In 2012, a theoretical study” suggested that 
boron could take part in a different kind of 
chemical bonding that exhibits an unusual 
type of donor-acceptor interaction. This 
involves borylene compounds (BR), in which 
one boron electron is engaged in a B—R bond 
and the remaining two electrons form a lone- 
pair orbital (Fig. 1c). This electronic struc- 
ture leaves two vacant molecular orbitals at 
boron. In other words, the borylene is a dou- 
ble Lewis acid (it has two vacancies), but also 
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Figure 1 | The electronic structure of boron 
compounds. a, Borane compounds, BR;, have 

a ‘vacancy’ — an absence ofa pair of electrons 
(empty square) — in their shell of valence 
electrons. R represents any group that can donate 
a single atom for bond formation. b, Lewis bases, 
such as tertiary amines (NR;), have a ‘lone pair’ of 
electrons (filled square), which they can donate 

to BR; to fill the vacancy, forming a complex. c, 
Borylene compounds have two vacancies and a 
lone pair. d, Borylene complexes can therefore 
form when two Lewis-base ligands (L) donate their 
lone pairs to the borylene. Braunschweig et al.’ 
report a borylene complex in which the boron 
atom binds two carbon monoxide ligands. 


a Lewis base (one lone-pair molecular orbital). 
Such borylene species could be stabilized by 
donor-acceptor interactions with two Lewis 
bases, forming complexes in which the boron 
atom retains its lone-pair orbital (Fig. 1d). This 
bonding situation was proposed for a (BR)L, 
molecule’ in which Lisa cyclic(alkyl) (amino) 
carbene (CAAC) ligand — a very strong Lewis 
base’, 

The theoretical study’ predicted that the 
analogous carbonyl complex (in which L is 
CO) would possess donor-acceptor bonds that 
are weaker than those of the CAAC complex, 
but strong enough to be synthesized. Braun- 
schweig et al. have now verified that prediction 
by making the borylene dicarbonyl complex 
(BR)(CO),, in which R is a bulky substituent. 
The authors also report unusual geometries 
and reactivities of related borylene complexes 
in which L is an isocyanate ligand, CNR. 

The researchers observe that the structure 
and reactivity of (BR)(CO), mimics the typical 
behaviour of transition-metal carbonyl com- 
plexes, which is unheard of in the chemistry 
of main-group elements. For example, when 
the team irradiated the complex with light, 
free BR was generated, and it immediately 
reacted with available substrates. Furthermore, 
the B—CO bonds in the dicarbonyl complex 
are significantly shorter than that of a related 
compound’ in which one of the CO ligands has 
been replaced by CAAC. The shorter bonds 
indicate strong m backdonation — a process in 
which the boron atom’s lone pair of electrons is 
donated to vacant molecular orbitals (known 
as 1* orbitals) of the CO ligands, and which is 
a typical feature of transition-metal carbonyl 
complexes. 

The isolation of the remarkably stable 


© 2015 Macmillan Publishers Limited. All rights reserved 


dicarbonyl complex (BR)(CO), also casts light 
ona controversy about the nature of bonding 
in compounds in which a main-group atom 
forms fewer bonds to other atoms than is nor- 
mal*"' (low-valent compounds). The boron 
complexes (BR)L, have the same electronic 
structure as the carbon compounds CL,. For 
example, carbon suboxide was reported” 
in 1906, and was first thought to be a linear 
molecule containing sequential double bonds 
(O=C=C=C=O). But later experiments” 
showed that the molecule is bent at 156° at 
the central carbon atom. It was therefore pro- 
posed“ that the compound is a complex of a 
bare carbon atom with two carbonyl ligands, 
C(CO),, and thus exemplifies a previously 
unrecognized class of compounds called 
carbones, in which carbon is in the zero oxida- 
tion state. Further examples have since been 
synthesized’*’*. The isolation of (BR)(CO), 
shows that related boron compounds are also 
quite stable, and helps to support the argument 
that carbones are complexes of bare carbon 
atoms with ligands. 

The synthesis of Braunschweig and 
colleagues’ dicarbonyl complex is just the 
starting point for further experimental and 
theoretical work in the field of low-valent boron 
complexes, and it will build a bridge between 
the chemistry of main-group compounds and 
that of transition-metal complexes. Consider- 
ing the ubiquitous use of transition-metal com- 
pounds as catalysts in industrially important 
reactions, the authors’ findings might also open 
up fresh opportunities in applied chemistry. m= 
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Pacific western boundary currents and 


their roles in climate 


Dunxin Hu’, Lixin Wu’, Wenju Cai?’?, Alex Sen Gupta‘, Alexandre Ganachaud’, Bo Qiu’, Arnold L. Gordon’, Xiaopei Lin?, 
Zhaohui Chen”, Shijian Hu, Guojian Wang’, Qingye Wang’, Janet Sprintall®, Tangdong Qu’, Yuji Kashino”®, 


Fan Wang! & William S. Kessler" 


Pacific Ocean western boundary currents and the interlinked equatorial Pacific circulation system were among the first 
currents of these types to be explored by pioneering oceanographers. The widely accepted but poorly quantified 
importance of these currents—in processes such as the El Nifio/Southern Oscillation, the Pacific Decadal Oscillation 
and the Indonesian Throughflow—has triggered renewed interest. Ongoing efforts are seeking to understand the heat 
and mass balances of the equatorial Pacific, and possible changes associated with greenhouse-gas-induced climate 
change. Only a concerted international effort will close the observational, theoretical and technical gaps currently 


limiting a robust answer to these elusive questions. 


estern boundary currents (WBCs) are swift, narrow oceanic 
\ \ | currents found in all major oceanic gyres. Within the Pacific 
Ocean, the subtropical gyre WBCs are the Kuroshio Current 
in the Northern Hemisphere, and the East Australian Current (EAC) in 
the Southern Hemisphere (Fig. la). The Pacific low-latitude, tropical 
belt WBCs include the Mindanao Current in the Northern Hemisphere 
and the New Guinea Coastal Undercurrent (NGCUC) south of the 
Equator, both of which are directly connected to the equatorial Pacific 
circulation system. Much of modern wind-driven ocean circulation 
theory was derived from a quest to understand these Pacific Ocean 
currents. Knowledge of the effect of the Earth’s rotation on WBCs, 
and of the Ekman transport (see Box 1), led to ground-breaking 
advances: that wind stress (Fig. 1b) is a driving agent of ocean currents, 
but it is the horizontal gradient rather than the absolute strength that is 
important; that latitudinal gradients in the effect of Earth’s rotation on 
the horizontal motion cause a flow intensification towards the west of 
the ocean basins; and that the Pacific wind-forced ocean currents 
include the equatorial current system, the low-latitude and subtropical 
WBCs. 

Impacts of the Pacific WBCs on the global ocean circulation and 
climate variability are manifold. First, in winter, the subtropical 
WBCs are associated with the largest supply of heat and moisture into 
the atmosphere in the Pacific basin, and are coupled to North Pacific 
storm tracks’. As cold and dry air comes into contact with the warm 
water carried poleward by the subtropical WBCs, heat and moisture are 
extracted from the surface. Second, the Indonesian Throughflow (ITF), 
the only low-latitude inter-ocean current, flows from the Pacific to the 
Indian Ocean. The throughflow plays a role in the return branch of the 
global thermohaline circulation’. The ITF sources its water mostly from 
the North Pacific Ocean via the Mindanao Current with the remainder 
sourced from the South Pacific Ocean via the New Guinea Coastal 
Current (NGCC), implicating these Pacific WBCs as important 
players in the global climate system. Third, the Pacific is home to the 


El Nino/Southern Oscillation (ENSO), the most prominent source of 
global climate variability on interannual timescales, which severely dis- 
rupts global weather patterns, affecting ecosystems, agriculture, tropical 
cyclones, drought, bushfires, floods and other extreme weather events 
worldwide’. The Pacific low-latitude WBCs are agents for transporting 
mass into the equatorial Pacific, therefore critically influencing the 
Western Pacific Warm Pool (a region of sea surface temperatures 
(SSTs) warmer than 28.5 °C) and the life cycle of ENSO, as well as the 
East Asian monsoon and the Indian/Southeast Asian monsoon. Finally, 
the EAC, through its outflow to the Indian Ocean, participates in the 
Southern Hemisphere supergyre circulation that links the three subtrop- 
ical gyres*®, providing a subtropical gateway for the Pacific’s influence 
on global climate. 

Research and observations over the past 15 years have enabled a string 
of advances in our understanding of the structure of individual compo- 
nents of the Pacific WBCs, interactions amongst them, and their 
climatic impacts. In particular, several multi-national programmes 
are underway. Included in these are the Northwestern Pacific Ocean 
Circulation and Climate Experiment’, the Southwest Pacific Ocean 
Circulation and Climate Experiment*, and the ITF monitoring pro- 
gramme’, providing an intensive observational focus in Pacific 
low-latitude regions. These process studies, together with targeted mod- 
elling and recent advances in broad-scale sampling of ocean temper- 
ature and salinity, upper oceanic currents, winds, precipitation and sea 
surface heights, as well as satellite retrievals, have led to a significant 
advance in our understanding of the physical structure and dynamics 
of the Pacific WBCs, and their possible changes under greenhouse 
warming. 

Here we review the current state of understanding of the structure and 
variability of the Pacific WBCs, their climatic impacts, and how they 
may be affected by greenhouse warming. We show that the entire Pacific 
WEC system moves northwards or southwards concurrently, on sea- 
sonal and interannual timescales. The climatic impacts of the Pacific 
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Figure 1 | Pacific Ocean circulation and boundary currents. a, Schematic of 
major currents and observed surface wind vectors (orange dashed lines) and 
wind stress curl (shading: positive curl in red; negative in blue). The Intertropical 
Convergence Zone and the South Pacific Convergence Zone are indicated by 
thick grey-dashed lines. Red and blue shading in the Southern Hemisphere 
corresponds to anticyclonic and cyclonic curls, respectively, and vice versa in the 
Northern Hemisphere. b, Pacific zonally averaged zonal wind stress (grey curve) 
and wind stress curl (red curve) for the present day (solid lines) and the latter 
half of the twenty-first century (dashed lines). Projected changes are calculated 
as the multi-model mean difference between 2050-2100 future winds under 
RCP8.5 and 1900-2000 historical winds. OC, Oyashio Current; KC, Kuroshio 
Current; NEC, North Equatorial Current; MC, Mindanao Current; NECC, 
North Equatorial Counter Current; NEC, North Equatorial Current; EUC, 
Equatorial Undercurrent; NGCUC, New Guinea Coastal Undercurrent; ITF, 
Indonesian Throughflow; SECC, South Equatorial Counter Current; SEC, South 


WBCs are far-reaching, and are exerted in many ways, including 
through interactions with the South China Sea circulation, the 
Tropical Pacific Warm Pool, and the ITF; through longevity of SST 
anomalies along the WBCs and their feedback to the atmosphere; and 
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Equatorial Current; EAC, East Australian Current; TO, Tasman Outflow; ACC, 
Antarctic Circumpolar Current; GS, Gulf Stream. c, d, Topographic and 
boundary current systems in the Northwest Pacific (c) and Southwest Pacific 
(d), as indicated by the dashed white boxes in a. Depths shallower than 2,000 m 
are colour-shaded. Grey shading indicates depths shallower than 50 m. The 
main currents, integrated from 1,000 m to the surface, are indicated by light blue 
arrows; the red curved arrow indicates the ‘freshwater plug’; red straight arrows 
indicate the main surface-trapped counter currents; purple arrows indicate 
undercurrents; question mark indicates uncertainty in flow path. Panel c is 
adapted from ref. 8 (John Wiley and Sons). LUC, Luzon Undercurrent; MUC, 
Mindanao Undercurrent; NGCC, New Guinea Coastal Current; NEUC, North 
Equatorial Undercurrent; NICU, New Ireland Coastal Current; NVJ, North 
Vanuatu Jet; NCJ, North Caledonia Jet; SCJ, South Caledonia Jet; NQC, North 
Queensland Current; GPC, Gulf of Papua Current; TF, Tasman Front; EAUC, 
East Auckland Current; TO, Tasman Outflow. 


through contribution to the global thermohaline circulation. Under 
greenhouse warming, the response of Pacific WBCs is uncertain, 
although changes are not expected to resemble anomalies associated 
with climate variability. 
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BOX | 
Processes associated with the 
Pacific surface WBCs 


Ekman transport and oceanic currents. In the tropical Pacific, the 
main currents, westward flowing over the upper ocean, are forced by 
the easterly trade winds. In the first few tens of metres of water depth, 
immediately away from the Equator, deflection by the rotation of the 
Earth occurs to the right in the Northern Hemisphere, and left in the 
Southern Hemisphere. This ‘Coriolis’ force causes ‘Ekman transport’, 
whereby tropical surface waters move poleward in both hemispheres. 
Likewise, poleward of 25° N and 25° S, the prevailing westerly winds 
induce Ekman transports towards the Equator, generating a 
convergence between 15°-30° N and 15°-30° S, with high sea levels 
around 10°-15° N and 10°-15° S, respectively. The sea-level slope 
drives two broad geostrophic westward flows: the North Equatorial 
Current (NEC) and the South Equatorial Current (SEC). 

Bifurcation and WBCs. Upon approaching the western boundaries, 
equatorial currents split into two branches: equatorward low-latitude 
WBCs; and poleward subtropical WBCs. In the Pacific, the NEC 
bifurcates into the equatorward Mindanao Current and the poleward 
Kuroshio Current, and the SEC bifurcates into the equatorward Gulf of 
Papua Current and poleward East Australia Current (EAC). 

Rossby wave adjustment. The above ocean wind-current 
relationship assumes a steady wind forcing the ocean for a long time. 
Winds vary on the large scale, and the ocean adjusts by generating 
large-scale anomalies that move in the form of oceanic Rossby waves. 
Such waves propagate along the thermocline, a boundary between the 
warm upper layer and the cold deeper part of the ocean, and always 
propagate westward because of the latitudinal gradient of the 
tangential speed of the Earth’s rotation. 

Wind stress curl. It is not the absolute magnitude of the wind stress 
but rather its curl (the horizontal gradient of the stress) that forces the 
north-south transport of the interior ocean. At some locations, the curl 
is reduced to zero and there is no north-south transport (although 
there might be flows at depth which cancel when summed up). Lines 
along which the curl is zero provide natural boundaries that separate 
circulation into gyres. Such lines in subtropical latitudes are often used 
as an approximation to locate the bifurcation latitude. 

Kuroshio Extension. At some latitude, the Kuroshio Current 
separates from the coast and flows eastward into the ocean interior. At 
this eastward excursion, the Kuroshio Extension has some of the 
largest air-sea fluxes found across the North Pacific basin. It is a region 
with a great quantity of latent, sensible, and net surface heat loss, and is 
co-located with the Pacific storm track. 

Tasman Front and Tasman Outflow. In the South Pacific, part of the 
EAC separates from the coast and veers eastward as the Tasman 
Front; the residual continues southward passing through the Tasman 
Sea. A portion of the Tasman Front attaches to the northern coast of 
New Zealand, forming the East Auckland Current. The portion flowing 
through the Tasman Sea reaches Tasmania and turns westward into 
the eastern Indian Ocean as Tasman Outflow. 


Structure of the Pacific WBC system 


Driven by the large-scale pattern of wind stress curl (see Box 1), the Pacific 
WEC system is characterized by the unique presence of two intense 
equatorward WBCs—the Mindanao Current and the NGCUC. These 
serve as subtropical-to-tropical oceanic pathways that modulate the mass 
and heat balances of the Western Pacific Warm Pool, and the ventilation 
of the equatorial Pacific thermocline. As such, low-latitude WBCs are a 
part of the life cycle of ENSO and Pacific decadal variability'™™’. 

The Mindanao Current and the NGCUC originate at the bifurcations 
of zonal currents arriving at the western boundaries: the westward-flowing 
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North Equatorial Current (NEC) in the North Pacific, and the South 
Equatorial Current (SEC) in the South Pacific. Upon reaching the 
Philippine coast, the NEC bifurcates to feed the poleward Kuroshio 
Current” and the equatorward Mindanao Current’ (Fig. 1c). Ina sim- 
ilar manner, the westward-flowing SEC bifurcates at the Australian 
coast into the northward-flowing Gulf of Papua Current" and south- 
ward-flowing EAC’. The Gulf of Papua Current feeds the NGCUC, 
which exits through narrow straits at the northern boundary of the 
Solomon Sea’, connecting the South Pacific to the Equator and the 
ITF (Fig. Ic, d). 

The convergence of the Mindanao Current and NGCUC near the 
Equator (Fig. 2) feeds the Western Pacific Warm Pool, the surface to 
lower thermocline of the ITF’’, and the eastward-flowing Equatorial 
Undercurrent (EUC)"*. In this way, the two low-latitude WBCs together 
with the strong mixing in the Indonesian Seas largely determine the 
water mass characteristics of the Pacific equatorial current systems 
and of the ITF'*. Also involved in the mass balance of the Western 
Pacific are counter currents, which are associated with the 
Intertropical Convergence Zone and the South Pacific Convergence 
Zone, the two prominent rainbands in the Pacific. These convergence 
zones themselves, together with local winds they have altered and the 
associated wind stress curl, produce two eastward-flowing, surface- 
intensified counter currents, known as the North Equatorial Counter 
Current’? and the South Equatorial Counter Current” (Fig. 1a). 

The Pacific low-latitude current system has a rich vertical structure, 
and over the years there has been conflicting evidence as to the nature of 
some currents”’*°’. In both hemispheres, the bifurcation latitude 
increases with depth. For example, the long-term average NEC bifurca- 
tion latitude is approximately 14° N at the surface, but 20° N at a depth 
of 1,000 m. The Luzon Undercurrent?*”*, observed beneath the 
Kuroshio, with a core at depths around 650 m near 18° N, intensifies 
towards the south before turning to the east to feed the North Equatorial 
Undercurrent. By contrast, despite the suggestion of a northward- 
flowing Mindanao Undercurrent with a centre at around 600 m (refs 21, 
24-26; Fig. 2a), debate persisted as to whether it is transient in nature’, 
a quasi-permanent flow but affected by thermocline variability’, 
or a subthermocline eddy”. The presence of the undercurrent at 
depths below 400 m has been confirmed by recent measurements 
(Fig. 2a; ref. 25). 

In the South Pacific, the range of the SEC bifurcation latitudes as a 
function of depth is large, varying between 13° and 25° S. The surface 
SEC flow turns southward feeding the EAC, while the subsurface flow, the 
Great Barrier Reef Undercurrent (part of the Gulf of Papua Current 
system), veers northward feeding the NGCUC”. The Great Barrier 
Reef Undercurrent, the South Pacific counterpart of the Luzon 
Undercurrent, has been linked to the basin-scale circulation of the sub- 
tropical gyre that contracts poleward with depth*’. The NGCUC was 
recently measured*' as a strong, permanent undercurrent against the 
coast of Papua New Guinea (Fig. 2b), with its core at about 400 m carrying 
the bulk of the South Pacific low-latitude WBC transport that splits in the 
different Solomon Sea straits then flows towards the Equator or the ITF*. 

Because the seasonal wind stress curl variability that controls the 
bifurcation latitude is due to seasonal movement of trade winds, the 
NEC and SEC bifurcations move synchronously and in the same meri- 
dional direction on seasonal timescales. At the surface, the NEC bifurca- 
tion latitude moves from 15° N in boreal summer, to 17° N in boreal 
winter™, in conjunction with the seasonal reversal of the South East 
Asia monsoon. This seasonal movement is accompanied by a stronger 
NEC leading to both a stronger Mindanao Current and Kuroshio 
Current when the bifurcation occurs at lower latitudes in boreal sum- 
mer, increasing the tropical water injection into the North Pacific 
subtropical gyre'*. As such, the NEC, the Mindanao Current and the 
Kuroshio Current transport east of Luzon are all at their seasonal max- 
imum during boreal summer (June-July) when bifurcation occurs 
at the southernmost latitude*** but minimum during boreal winter 
(November-December) when bifurcation occurs at the northernmost 
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Figure 2 | Characteristics of the two equatorward low-latitude WBCs in the 
tropical Pacific. Such WBCs are a unique feature of the Pacific Ocean. 

a, Moored current meter data at 8° N, 127° E showing evolution of meridional 
velocities, the Mindanao Current (blue, negative values) and its undercurrent 
(red, positive values) from boreal winter 2010 to boreal winter 2012, adapted 

from ref. 26 (John Wiley and Sons). Shown are daily ADCP data smoothed bya 
three-day running mean filter. Moorings were anchored in 6,100 m deep water, 
with two 75K RDI ADCP current meters at ~400 m scanning up and down, 


latitude. The SEC bifurcation shifts northward towards the Equator in 
austral summer (November-December), and is southernmost in austral 
winter (June-July)'°. When the SEC is closer to the Equator, its transport 
strengthens, feeding more water into both the EAC and the NGCUC”. 
The EAC attains a maximum transport (up to 36 Sv; 1 Sv = 10° m*s_') 
and extends furthest poleward in austral summer with a large Tasman 
Outflow into the Indian Ocean. In austral winter, the EAC transport 
weakens (to 27 Sv), and much of this veers eastward as the Tasman Front 
(Fig. 1d; ref. 36). 

Consistent with dynamical argument, observations have shown that 
the South Pacific subtropical gyre and the Tasman Outflow south of 
Australia are part of a system of nested anticyclonic gyres encompassing 
the entire Southern Hemisphere subtropics, connecting the Atlantic, 
Indian and Pacific basins to the global thermohaline circulation***”*. 
The Atlantic and Indian Oceans are connected south of Africa by a 
westward Agulhas leakage and the eastward South Atlantic Current”, 
while the Pacific and Indian Oceans are connected south of Tasmania by 
the westward Tasman Outflow**”’, a narrow boundary flow of the west- 
ern Pacific gyre that turns westward around the south of Tasmania and 
‘leaks’ to the Indian Ocean basin*. Tasman leakage occurs primarily as a 
subsurface or intermediate water circulation’, in contrast to the ITF 
route that largely occurs in the upper 300 m. The linkage of the 
Tasman leakage to the supergyre provides a mechanism whereby 
Antarctic Intermediate Water, which transits to the southwest Pacific, 
is distributed between the ocean basins before it spreads northward into 
the Pacific, Indian and Atlantic oceans** 


Variability of the Pacific WBCs 


Pacific WBCs are subject to strong variability on intraseasonal’*’”””, 
interannual?’*’, decadal and longer timescales***’. However, as a 
result of possible eddy aliasing“, and the discrete short-term nature 
of in situ surveys that may be difficult to place in an appropriate climate 
context, there is large uncertainty in the volume transport estimate of 
these WBCs. For example, the Mindanao Current transport ranges from 
13 to 39 Sv depending on the study’??”*". 
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respectively. The current meters move vertically in the water column, as strong 
currents interact with the mooring system. b, Velocity section across the South 
Solomon Sea, from a compilation of glider observations adapted from ref. 31 
(AMS). Units are cm s_!. Red shading indicates equatorward flow, and blue 
poleward flow. The NGCUC is characterized by a strong core, centred at about 
400 m, against the coast of Papua New Guinea, to the left of the section. The 
x-axis represents an equivalent eastward distance from the coast of Papua New 
Guinea. 


The NEC bifurcation latitude and its partitioning into the Mindanao 
Current and the Kuroshio Current are affected by many factors, includ- 
ing local monsoonal winds and buoyancy fluxes, and remote forcing 
from the Pacific interior and North Pacific along the western bound- 
ary”. The main forcing, though, is ENSO. The response of the WBCs 
to ENSO, in turn, influences the life cycle of ENSO. As an El Nifo 
develops, the entire Pacific circulation system ‘breathes’ together, bifurc- 
ating at a more northerly latitude (Fig. 3a, b), as manifested through a 
positive correlation of ENSO with the north and south bifurcation lati- 
tudes. Accompanying the more northerly bifurcation are a stronger 
NEC and North Equatorial Counter Current, a stronger Mindanao 
Current, a more intense EUC at its westernmost part, but a weaker 
Kuroshio Current and ITF**°*° (Fig. 3a). To the south, the NGCUC 
intensifies during, or several months after, the peak of an El Nino’, and 
the surface southward NGCC disappears and turns northwards 
(Fig. 3b), as occurred from July to February during the 1997-98 
extreme El Nifo*’. Although the EAC transport displays variability 
on interannnual timescales, only a weak ENSO signal is evident in 
observations”®. 

The responses of the Mindanao Current and the NGCUC to ENSO as 
described above are integral to the ENSO discharge/recharge cycle*®. 
The increased equatorward WBC transport during an El Nino creates 
a strong confluence of relatively cooler water in the tropical western 
Pacific, and constitutes a compensatory flow for the associated discharge 
of the equatorial Pacific warm water, conducted in the interior Pacific 
through the Sverdrup process, whereby mass is pumped out of the 
equatorial west Pacific in response to wind forcing. 

Recent observations revealed a rich spatial structure and a large res- 
ponse of the SEC transport to ENSO*”’. Variability of the NGCUC 
inflow to the Solomon Sea can reach 100% of the mean transport, with 
the changes mainly occurring in the upper 250 m. During an El Nifo 
event, the SEC transport between New Caledonia and the Solomon 
Islands increases*"*"*, particularly in the North Vanuatu Jet, with an 
enhanced transport entering the Solomon Sea (Fig. 3b). This leads to an 
increased equatorward mass transport through the Solomon Strait and 
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Figure 3 | Impact of El Nifio on the Pacific WBC system. a, b, During an El 
Nii, the bifurcation moves north in both the Northern (a) and the Southern 
(b) Hemispheres, as indicated by the black arrows. The ocean depth is indicated 
as on Fig. Ic, d. Increasing (green boxes) and decreasing (orange boxes) flow 
anomalies during El Nifo are indicated. Some currents are enhanced, while 
others are weakened, as indicated by the anomalies (large dark blue/red 
arrows). From the developing phase to approximately three-to-four months 
before the peak of an El Nijfio, the Kuroshio Current (KC) decreases while the 
Mindanao Current increases. The ITF weakens as the Pacific-Indian pressure 


through the New Ireland Coastal Undercurrent, in addition to the 
increased transport via the Vitiaz Strait” (see Fig. 3b). The transport 
of heat and salt through the Solomon Sea can change by up to a factor of 
two between extreme El Nifio and La Nijfia conditions”. 

Connections between the Pacific WBCs and the ITF are complex, 
particularly when ENSO is considered. Synthesis of observational data 
and model simulations indicates that the depth and velocity of the 
ITF core vary with ENSO, with the ITF slowing and shoaling 
during El Nifio events**. Modelling and observational studies indicate 
that, on average, the proportion of the ITF water derived from the 
North and South Pacific depends sensitively on ENSO phases. 
Climatologically, approximately 80% of the ITF originates from the 
North Pacific and 20% from the South Pacific via the NGCUC*™°. As 
a La Nira develops, the ITF increases due to stronger easterly trade 
winds, which induces a pressure gradient between the West Pacific 
and the eastern Indian Ocean. Climate models suggest a greater 
amount of ITF water originates from the South Pacific during La 
Nifia compared to during El Nifio*. During the development of La 
Nifia, a seasonal and ENSO-dependent inflow from the Luzon Strait 
to the Indonesian Seas forced by large-scale winds in the Pacific, 
called the South China Sea Throughflow*!, weakens. Further, the 
Mindanao Current transport decreases, though a greater portion of 
this current goes into the ITF at the expense of the North Equatorial 
Counter Current®. 

Interactions between the Pacific WBCs and the Pacific Decadal 
Oscillation (PDO) occur on decadal timescales in a manner somewhat 
different to the interannual timescales for ENSO, particularly in the 
North Pacific”. The post-1990 cold PDO phase is linked to a south- 
ward shift of the NEC bifurcation latitude, and a strengthened NEC and 


Bottom depth (m) 


gradient reduces. The South China Sea Throughflow intensifies, setting up a 
freshwater plug. Towards the peak of an El Nifo, equatorward transport 
enhances through NICU and NGCUC. LUC, Luzon Undercurrent; MUC, 
Mindanao Undercurrent; NGCC, New Guinea Coastal Current; NEUC, North 
Equatorial Undercurrent; NICU, New Ireland Coastal Current; NVJ, North 
Vanuatu Jet; NCJ, North Caledonia Jet; SCJ, South Caledonia Jet; NQC, 
North Queensland Current; GPC, Gulf of Papua Current; TF, Tasman Front; 
EAUC, East Auckland Current; TO, Tasman Outflow. 


North Equatorial Counter Current, as a result of intensified easterlies 
over the Western Pacific’. Both the Kuroshio Current and the 
Mindanao Current have also strengthened, inconsistent with ENSO- 
induced changes, which would imply a decreased Mindanao Current. 
In the Southern Hemisphere, observations and ocean reanalysis suggest 
that a cold phase of the PDO is associated with a deeper southward EAC 
extension”®, while the Tasman Front weakens, which suggests that there 
is a gating between these two currents”. In the two decades before 1993, 
the PDO was in a warm phase, whereby the trade winds across the 
tropical Pacific were weaker, consistent with a weaker atmospheric 
Walker circulation’’. In response, the NEC and its counter currents were 
also weaker and shifted northward. 

Multidecadal fluctuations can also modulate the interannual relation- 
ship between ENSO and the NEC bifurcation latitude, such that the 
relationship between ENSO and the northwestern Pacific WBCs is 
non-stationary’**. The NEC mean bifurcation latitude is situated fur- 
ther to the south during a cold PDO phase, and from this more south- 
ward position the NEC bifurcation latitude can ‘jump’ a considerable 
distance to the north during an El Nifio. As such, interannual NEC 
bifurcation variability is more strongly correlated to the Nifio-3.4 index 
during the cold phase of the PDO. Thus, the bifurcation latitude of the 
NEC which is determined by wind forcing in the 12°-14° N band also 
contains variability not solely represented by the Nifio-3.4 index”. 


Interactions with climate 

The Pacific WBCs interact with physical climate on synoptic, interann- 
ual and long-term climate scales, and such interactions are better under- 
stood for the North Pacific than for the South Pacific. In boreal winter, 
strong advection of warm water by the Kuroshio Current into regions of 
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colder air temperatures, such as the Kuroshio/Oyashio extension region, 
results in large losses of latent and sensible heat to the atmosphere™*”’. 
This heat loss triggers oceanic and atmospheric deep convection, and 
fuels storms for their recurrent development, while contributing to the 
formation of mode waters*. In the mixed water region north of the 
Kuroshio Extension, turbulent heat fluxes of more than 600 W m~” 
are observed during wintertime northerly wind events”, where the con- 
trast between the WBC warm core and adjacent waters gives rise to 
particularly large SST gradients: more than 10°C over 200 km across 
the Oyashio Front. The presence of such oceanic fronts can generate 
atmospheric instability, and increased vertical momentum exchange, as 
well as winds induced by changes in air density’’**. This tends to exert a 
positive feedback, whereby larger turbulent heat fluxes from the ocean to 
the atmosphere over the warm core of the current generate even stronger 
winds*”**. Atmospheric eddy heat fluxes associated with synoptic sys- 
tems act to reduce the sharp meridional air temperature gradients’, but 
the large thermal inertia of the ocean mixed-layer and oceanic advection 
rapidly restore the SST front and the air temperature gradient”. An 
impact of the warm water maintained in this manner is that typhoons 
tend to strengthen as they pass over the warm core of the Kuroshio 
Current. During summer time, heat and moisture supply from the 
warm Kuroshio, though less than in winter, is effective in retaining 
the convectively unstable stratification of low-level northwestward 
monsoonal airflow from the tropics, enhancing the convective heavy 
rainfall along the Kuroshio®. 

Beyond synoptic scales, the North Pacific WBCs exert a considerable 
impact on Asia summer monsoons™**. One such impact is on the South 
China Sea summer monsoon, which marks the commencement of the 
Asia summer monsoon. In post-La Nina boreal spring and early sum- 
mer, the western tropical Pacific heat content is anomalously high, as a 
part of the heat recharge associated with stronger trade winds. In asso- 
ciation with a stronger Walker circulation, convection over the eastern 
Indian Ocean and the Western Pacific remains strong, resulting in 
stronger-than-normal westerlies over the tropical Indian Ocean deliver- 
ing the crucial moisture source favourable for the South China Sea 
summer monsoon®’. Consequently, the South China Sea summer mon- 
soon commences earlier than normal, leading to droughts along the 
middle-lower Yangtze River basin and floods in southern and northern 
China*®!. Thus, the heat content of the Western Pacific Warm Pool can 
be a predictor of the South China Sea summer monsoon onset and 
strength, with a high heat content associated with an early onset and 
strong intensity. Conversely, in the late boreal spring-early summer 
after an El Nifio, when the warm pool is still cooler than normal, the 
South China Sea monsoon takes place later than normal, causing floods 
along the middle-lower Yangtze River basin and drought in southern 
and northern China®*. A similar link exists between the Australian 
summer monsoon and ENSO via SST anomalies in the central and 
Western Pacific™; however, the timing is different. The Australian sum- 
mer monsoon tends to be weakened, concurrently, by an El Nifio, and 
strengthened by a La Nina. 

Interactions between the North Pacific WBCs and the South China 
Sea Throughflow® on seasonal and interannual timescales constitute 
another source of climatic influence. During the boreal winter, when 
the northwest monsoon prevails over the Indonesian Seas, or during El 
Nino periods, the wind drives buoyant, low-salinity surface water 
derived from the South China Sea via the Karimata Strait (Fig. 1c) into 
the southern Makassar Strait and from the Sibutu Passage into the 
northern Makassar Strait*®. This creates an anomalous pressure gradient 
in the surface layer of the Makassar Strait (red curved arrow, Fig. 1c) that 
inhibits the warm surface water from the Mindanao Current from enter- 
ing into the ITF. This surface-layer ‘freshwater plug’, originally due to 
heavy rainfall and large runoff from Southeast Asia, inhibits the Pacific 
warm surface water from flowing southward into the Indian Ocean, 
leading to a reduced, cooler ITF in the Makassar Strait during 
December-January and during El Nino, and inducing a cooler upper 
Indian Ocean, which may in turn weaken the Asian monsoon”. During 
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the boreal summer, when winds in the Indonesian Seas reverse to the 
southeast monsoon, or during La Nifa periods, the obstructing pressure 
gradient is reduced, reducing the impact of the freshwater plug, support- 
ing an earlier and stronger South China Sea monsoon in the late spring, 
or early summer after a La Nifa event”®. 

Variations in the North Pacific WBCs also exert a climatic impact 
through the longevity of SST anomalies they generate****°°”°, influ- 
encing the large-scale atmospheric circulation and storm-track activ- 
ities, particularly near the multiple fronts of the WBC extension’”!”’. 
These extension fronts are fertile grounds for strong SST variability. 
Because of the strong horizontal gradients, a slight frontal movement 
can lead to a large change in SSTs and is an effective process for gen- 
erating SST variability. Frontal movement can be generated by 
intrinsic variability of the WBCs as well as by remote ocean and atmo- 
sphere forcing via extratropical ocean Rossby waves”~”°. The Kuroshio 
Extension fronts, for example, are strongly influenced by large-scale 
changes in the subtropical gyre driven by wind stress changes across 
the North Pacific’*”*, and by ENSO-induced basin-scale wind stress curl 
anomalies through atmospheric teleconnections”*, emanating from 
the tropical Pacific, remotely forcing SST variability in the Kuroshio 
Extension region via oceanic Rossby waves’*”*. 

The longevity of SST anomalies along the Kuroshio Extension can be 
further enhanced by processes that either maintain or reinforce the 
anomalies’ *°. For example, in boreal summer, the North Pacific 
Ocean cools the warm moist air advected from the south, inducing 
the formation of low-level stratus and fog within the boundary layer 
on the northern sides of the Kuroshio Extension fronts’””*. These clouds 
(in particular low clouds) reduce solar insolation because of their high 
albedo, in turn maintaining low SSTs”°”’. This positive ‘cloud-SST feed- 
back’ acts to reinforce cold SST anomalies, and can enhance and prolong 
summertime cold SST anomalies near the North Pacific subarctic front 
zone’®. In boreal winter, through anomalous geostrophic heat advection, 
the axial migration of the Kuroshio Current and the associated frontal 
movement generate subsurface temperature anomalies that can persist 
over multiple years. Shoaling of the North Pacific mixed layer in boreal 
spring and deepening in autumn results in mid-latitude SST anomalies 
recurring from one winter to the next without persisting through the 
intervening summer’”*°. This ‘re-emergence mechanism’ operates when 
SST anomalies spread throughout the deep winter mixed layer, but 
remain beneath the mixed layer in spring, isolated from the ‘damping’ 
surface fluxes. When the mixed-layer deepens again in the following 
autumn, the temperature anomalies are brought out into the surface 
layer and influence SST*. Along the Kuroshio Extension, advection 
by the mean currents transports these anomalies by as much as 4,000 
km over the course of a year”*°. These processes are favourable for 
decadal SST variability such that it is particularly strong along such 
fronts??*", 

Climatic impacts of the EAC are less well understood, but there is 
suggestion of an influence on Australian and New Zealand climates. 
Weaker EAC transport is linked to anomalous cool conditions in New 
Zealand®, and a stronger EAC is shown to promote rainfall along the 
east Australia coast*’. However, little is known about the impact of the 
EAC on, for example, storm tracks or atmospheric circulations. 

The Pacific WBCs exert substantial influence on timescales longer 
than decadal, and on climate remote from the Pacific regions, through 
their linkage to the Southern Hemisphere supergyre circulation via the 
Tasman Outflow**** and the ITF’*****’”. The Tasman Outflow influ- 
ences downstream ocean climate through intermediate-depth water‘, 
but the influence through the ITF is conducted through the upper 
300 m (ref. 9). Increased transport within the warmer upper layer of 
the ITF modifies the heat budget of the Indian Ocean***? as well as the 
residence time of tropical Indian Ocean waters*®, thermocline depth and 
SST patterns*’. Further downstream, the ITF feeds the Agulhas leakage, 
and ultimately provides an important heat source for supporting the 
global thermohaline circulation’. 
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Observed changes and future projection 


Over the past century, there has been direct and indirect evidence sup- 
porting changes having occurred in the subtropical gyres. For example, 
there appears to be a synchronous surface ocean warming along the path 
of all subtropical WBCs that is two-to-three times greater than the global 
mean**. The enhanced warming is associated with a poleward shift and/ 
or intensification of global subtropical WBCs in conjunction with a 
systematic change in winds over both hemispheres. Specifically, the 
zero-curl line shifted poleward in the North Pacific, the North 
Atlantic and the South Atlantic Oceans; in the South Pacific and 
South Indian Oceans, high-latitude westerly winds strengthened while 
mid-latitude southeasterly trade winds weakened**’, consistent with a 
positive trend of the Southern Annular Mode*”, leading to accelerated 
warming and an intensified supergyre. 

Over the past 60 years, there has been a synchronous long-term 
southward shift of the NEC and the SEC bifurcations®’ in the tropical 
Pacific (Fig. 4a, b), similar to their synchronous movement on inter- 
annual timescales. Over the 60-year period, the overall long-term south- 
ward shift of the NEC is modulated by a slight northward migration 
from 1970 to 1992 (ref. 90) and a southward shift in the post-2000 
period, associated with the warm and cold PDO phases, respectively. 
The long-term southward shift of the latitude of the NEC bifurcation 
suggests a strengthening Kuroshio Current, consistent with the observed 
accelerated warming (Fig. 4a). By contrast, the southward shift of the 
SEC bifurcation has no bearing on the transport partition, but is accom- 
panied by a substantial southward extension of the EAC**’. The relative 
importance of greenhouse warming and decadal variability for the 
North Pacific changes is difficult to quantify. However, about two-thirds 
of the poleward intensification of the Southern Hemisphere supergyre 
circulation, including the southward EAC extension and the strength- 
ening of the Tasman Outflow’'”’, has been attributed to wind changes 
induced by Antarctic ozone depletion”’, and one-third to increasing 
atmospheric CO, (refs 91-93). 
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Figure 4 | Trend in the bifurcation latitude of the North and South 
Equatorial Currents. a, b, Time series of annual mean NEC bifurcation 
latitude integrated over the upper 381 m (a; red curve, adapted from ref. 90 
(John Wiley and Sons)), and of SEC bifurcation latitudes integrated over the 
upper 200 m (b; blue line, adapted from ref. 89 (John Wiley and Sons)), based 
on SODA version 2.2.4. The bifurcation in each layer of SODA data is defined 
as where the meridional velocity averaged within a 2° longitude band off the 
Philippine/Australian coast is zero. Correlation between the NEC bifurcation 
latitude and Nifio-3.4 is r = 0.55, and between the SEC bifurcation latitude and 
Nifo-3.4 is r = 0.40, both statistically significant above the 95% confidence 
level. Both trend lines are above the 95% confidence level. 
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Projected changes in the Pacific WBCs depend on emission scenarios 
of greenhouse gases. Antarctic ozone is projected to recover by around 
the middle of this century, providing a mechanism to offset the impacts 
of a projected continuous increase in CO,. As such, climate models 
under the business-as-usual emission scenario, which currently matches 
the observed emission rates, simulate no further trend in the Southern 
Hemisphere westerly winds before 2045, but an acceleration in the post- 
2045 period”. In the following two paragraphs, we discuss projected 
changes averaged over the 2050-2100 period referenced to the mean 
conditions over the twentieth century, under this emission scenario. 

Climate models consistently indicate that the Tasman Outflow trans- 
port will continue to increase (from ~2 Sv in the twentieth century to 
~6 Sv over 2050-2100) and that the latitude of maximum EAC transport 
will shift further south (~1.5° latitude over 2050-2100; ref. 94); however, 
the projected EAC maximum transport shows little change (Fig. 5). In the 
North Pacific, the Kuroshio Current is projected to weaken at its origin by 
approximately 10%, although the maximum Kuroshio transport shows 
little change. The EUC tends to strengthen in the eastern and central 
Pacific (Fig. 5) and become shallower, particularly in the west”*”®. This 
tendency for strengthening occurs despite a weakening of the equatorial 
trade winds”. The projected EUC intensification is driven by a projected 
strengthening of the NGCUC that results from a wind stress curl change 
that is negative, associated with a weakening of the equatorial trade winds 
and a strengthening of the southeasterly trade winds”®. A strengthening of 
the NGCUC supporting a strong EUC is consistent with a present-day 
observation that much of the NGCUC transport goes to support the 
EUC”, although the range in model projections is rather large. 

Both the Mindanao Current and the ITF are projected to weaken 
(Fig. 5). The reduced ITF is not fully accounted for by the projected 
wind-induced changes through Sverdrup dynamics that exclude an 
influence from upwelling deep water. This exclusion is an invalid 
assumption because such upwelling is important on timescales relevant 
for climate change. It is conceivable that a projected slowdown of the 
global thermohaline circulation”* would drive a weakening in upwelling 
of deep water in the South Pacific, contributing to the projected ITF 
decrease, and following the notion that the ITF is in part supported by 
the upwelled deep water associated with the compensating flow for the 
global thermohaline circulation’. 


Uncertain outcome 


There is little doubt that the mean climate over the Pacific will continue to 
change in the coming century owing to past and future emissions of 
greenhouse gases. Climate models suggest that the tropical trade winds 
are likely to weaken*, and that the southern mid-latitude westerlies are 
likely to undergo a poleward shift and intensification’’™, and contribute 
to the intensification of the WBCs, although changes in North Pacific 
winds are less clear”’. These changes, together with a projected weakening 
in the global thermohaline circulation”’, may simultaneously affect the 
Pacific WBCs. Further, such changes may not resemble anomalies assoc- 
iated with variability on interannual or inter-decadal timescales”. 
Although climate models agree on many aspects of the projected future 
changes, such as the weakening of the NEC and the NECC and the 
strengthening of the NGCUC and the NQG, the simulation of WBCs is 
generally poor because these coarse-resolution simulations do not resolve 
the terrain of marginal seas, complex flow structure of WBCs, bathymetry 
and eddies. Certain aspects of future change in high-resolution models 
remain consistent with the lower-resolution models, but uncertainty in 
projected changes remains high. Another challenge is the lack of informa- 
tion about the flow structure and amplitude of the low-latitude Pacific 
WBCs in the real world, necessary for benchmarking climate models. For 
example, despite an extensive observational campaign, the ITF mass bud- 
get is not closed; the estimated outflow exceeds the inflow by ~2 Sv 
(ref. 9). It is unknown whether a contribution from the NGCUC through 
the Molucca and Halmahera Seas might account for the discrepancy. This 
mass imbalance is linked to many long-standing issues, including whether 
Antarctic Intermediate Water, through the NGCUC and the Mindanao 
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Figure 5 | Modelled transports of annual-mean Pacific WBCs and their 
projected changes. a, Historical mean transport averaged over the twentieth 
century (positive values indicate northward, and negative, southward 
transports), and b, projected change from the historical mean, averaged over 
the 2050-2100 period under RCP8.5, for selected Pacific currents (listed at 
bottom). Positive changes of positive mean transport indicate strengthening, 
and vice versa. Shown are individual models (markers; see key at top), multi- 
model mean (horizontal black lines) and multi-model medians (horizontal red 
lines). Values at the top of each column in b indicate the number of models (out 
of a total of 21) that agree on the sign of the change. Based on a binomial 
distribution, agreement of 13, 14, 15 and 16 models (out of 21) corresponds to 
significance levels of 81%, 90.5%, 96% and 99%, respectively. Mean EUC 
transports for IPSL-CM5A-MR/LR and CNRM-CM5 are greater than 90 Sv, 
and are not shown in a. 


Undercurrent, reaches east of the Luzon northern tip’’, or directs east- 
ward between 10°-12° N under the NEC™”*. Without such ground-truth 
information, our confidence in model performance will remain low. 
With a sustained community effort, our understanding of the Pacific 
WBCs and their impacts will continue to improve. The ultimate goal is to 
produce a reliable projection of the Pacific WBCs and their climatic 
impacts that is consistent with our physical theoretical understanding, 
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as well as with what observations show. To this end, coordinated observa- 
tions, studies of processes, and modelling should be bolstered. Our models 
need to resolve the complex terrain of marginal seas and the detailed 
structure of the WBCs, together with bathymetry, mesoscale eddies, and 
other currently unresolved processes, especially turbulent mixing. An 
international WBC intermodel comparison project may offer a good ave- 
nue for a substantial advance. These modelling efforts need to be sup- 
ported by a coordinated observation system integrating high-density 
mooring arrays across all Pacific WBCs, satellite observations, Argo floats 
and platforms that enable near-coast measurements, to provide a three- 
dimensional picture of the coast-to-deep-water WBC structure. In the 
meantime, we need to combine all sources of information to make 
reliable predictions of variability and robust assessments of the response 
of the Pacific WBCs to greenhouse warming and the associated climatic 
impacts. 
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Hippocampal-prefrontal input supports 
Spatial encoding in working memory 


Timothy Spellman!, Mattia Rigotti?*, Susanne E. Ahmari®®, Stefano Fusi?”, Joseph A. Gogos!* & Joshua A. Gordon®? 


Spatial working memory, the caching of behaviourally relevant spatial cues on a timescale of seconds, is a fundamental 
constituent of cognition. Although the prefrontal cortex and hippocampus are known to contribute jointly to successful 
spatial working memory, the anatomical pathway and temporal window for the interaction of these structures critical to 
spatial working memory has not yet been established. Here we find that direct hippocampal-prefrontal afferents are 
critical for encoding, but not for maintenance or retrieval, of spatial cues in mice. These cues are represented by the 
activity of individual prefrontal units in a manner that is dependent on hippocampal input only during the cue-encoding 
phase ofa spatial working memory task. Successful encoding of these cues appears to be mediated by gamma-frequency 
synchrony between the two structures. These findings indicate a critical role for the direct hippocampal-prefrontal 
afferent pathway in the continuous updating of task-related spatial information during spatial working memory. 


Spatial working memory (SWM) is an essential feature of goal- 
directed action. Locating a resource, a threat, or even oneself within 
a dynamic or unfamiliar environment requires a cached representa- 
tion of relevant spatial features that must be continuously updated, 
preserved and applied as needed to the execution of adaptive beha- 
viours'. Despite long-standing interest in the neurobiological under- 
pinnings of SWM, its multiple cognitive components, distributed 
anatomical constituents and distinct temporal phases have rendered 
its underlying circuit mechanisms elusive” °. Nevertheless, an extens- 
ive body of work supports the idea that the prefrontal cortex (PFC) 
has a central role in the successful execution of tasks requiring SWM’. 
Moreover, the joint contribution of the medial prefrontal cortex 
(mPFC) and hippocampus (HPC) supports successful SWM in 
rodents**. It remains unclear, however, which phase(s) of SWM 
(encoding, maintenance, and/or retrieval) require the joint participa- 
tion of the HPC and mPFC, what behaviourally relevant information 
is conveyed between the two structures, and through which anatom- 
ical pathway(s) they interact. 

Direct HPC-mPFC connectivity is confined to a unidirectional 
projection from the CA1/subiculum of the ventral-most two-thirds 
of the hippocampus (vHPC)’***. Cells in both the VHPC and mPFC 
exhibit location-specific firing’®'’, and damage to the vHPC disrupts 
goal-related activity in the mPFC", suggesting that the direct VHPC- 
mPFC projection may transmit critical location information during 
SWM tasks. 

We used a rodent delayed non-match-to-place (DNMTP) task, 
known to require activity in the VHPC and mPFC, to test the role of 
VHPC-mPFC afferents in SWM. We applied a projection-specific, 
optogenetic silencing approach that afforded anatomical and tem- 
poral precision. Inhibiting vHPC-mPFC direct input disrupted 
encoding, but not maintenance or retrieval, of location cues necessary 
for task performance. Furthermore, we found that goal-selective firing 
in the mPFC was dependent on vHPC direct input exclusively during 
the encoding phase of each trial. Finally, we found evidence suggesting 


that the transmission of task-critical information through the VHPC- 
mPFC pathway is mediated by the synchronization of mPFC units 
to gamma oscillations in the VHPC. Together, these findings suggest 
that the direct VHPC-mPFC pathway enables the encoding of salient 
spatial cues during SWM performance. 


Optogenetic inhibition of VHPC terminals 
To interfere specifically with vHPC-mPFC inputs, a projection- 
specific targeting approach was used. An adeno-associated virus 
vector (AAV2/5) engineered to express fluorescently labelled archae- 
rhodopsin (eArch3.0-enhanced yellow fluorescent protein 
(eYFP))”°*? was targeted to subfield CA1 of the vHPC in mice 
(Methods). Robust expression was seen in the dendrites and axons 
of the vVHPC and in projection axons in the mPFC (Fig. 1a). 
Arch-mediated hyperpolarization of distal axon terminals dis- 
rupted synaptic transmission in vivo without affecting spontaneous 
vHPC firing. The effect of terminal illumination on synaptic trans- 
mission was measured in acutely anaesthetized mice. Electrical stim- 
uli were delivered via a bipolar stimulating electrode to ventral CA1 in 
both Arch-expressing (Arch*) mice and Arch-negative controls 
(Arch’). Postsynaptic multi-unit responses were observed in the 
mPFC (Fig. 1b). Light pulses delivered to the mPFC on interleaved 
trials reduced the evoked response by ~40% for Arch* animals but 
not Arch” animals (Fig. 1b, c). To measure the effects of terminal 
illumination on vHPC cell bodies, mice were implanted with 
optical-fibre-coupled stereotrodes in the VHPC and optical fibres in 
the mPFC. In the awake, resting state, light pulses delivered to the 
VHPC reduced local spontaneous multi-unit activity by ~50% 
(Fig. 1d), while illumination of terminal fields in the mPFC had no 
effect on multi-unit activity in the VHPC (Fig. le). These experiments 
demonstrated successful and specific inhibition of terminals in vivo, 
an approach that was then applied to a behavioural paradigm to 
examine what role the VHPC-mPFC projection has in spatial working 
memory. 
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Terminal inhibition impairs encoding 

To assay working memory performance, a T-maze DNMTP task was 
employed. In this task, each trial is divided into three phases (Fig. 2a). 
In the sample phase, one of two goal locations is blocked by a wall, and 
the mouse is directed towards a food reward in the open location; 
during this phase the animal must encode the location of the sample 
goal. In the delay phase, the mouse returns to the start box and must 
maintain the sample goal in working memory during a variable delay 
(Methods). In the choice phase, the wall is removed, and the mouse 
must select the previously closed arm to receive a second reward. 
After successful task acquisition, light was delivered to vHPC- 
mPFC terminal fields in Arch* and Arch” mice during the entire 
trial (Entire Trial condition), the sample phase only (Sample Light), 
or the choice phase only (Choice Light). Trial types were randomly 
interleaved. 


a Two-goal T-maze 
Sample Choice 

WiArch* Mf Arch- 

8 80 
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Figure 2 | Inhibition of VHPC-mPFC terminals impairs encoding. a, Two- 
goal DNMTP task. b, Effect of mPFC illumination on performance in the two- 
goal task differed by trial phase and virus type (ANOVA, virus-by-light 
interaction, F = 5.92, P = 0.02). Post-hoc tests revealed effects of Entire Trial 
(t = 3.96, P = 0.002) and Sample Light (tf = 2.98, P = 0.011) but not Choice 
Light (t = 1.1, P = 0.29) conditions in Arch* (n = 8) versus Arch” (n = 6) 
animals. Entire Trial and Sample Light but not Choice Light performance 
differed from the No Light condition in Arch” (t = 4.9, P = 0.002, Entire Trial; 
t = 4.5, P = 0.003, Sample Light; t = 1.7, P = 0.125, Choice Light) but not 
Arch’ animals (t = 0.18, P = 0.87; t = 1.1, P = 0.3; t = 1.3, P = 0.25, 
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Figure 1 | Optogenetic inhibition of VHPC- 
mPFC terminals in vivo. a, Expression of Arch 
(middle row, green) and mCherry (top row, red) in 
| ventral CA1. Arrow, lesion marking electrode 
location. Bottom row, Arch in terminals in the 
prelimbic (PL) and infralimbic (IL) mPFC. Cg1, 
anterior cingulate cortex. Original magnifications, 
X 2 (left), < 20 (right). b, mPFC multi-unit 
responses to VHPC stimulation in Arch* mice. 
n = 16 sites from 3 animals, ANOVA F = 5.6, 
P = 0.02 for light effect, *P < 0.05, post-hoc t-test. 
Baseline rate = 6.1 + 0.14 Hz. c, Group mean 
evoked mPFC spike rate summed across 5-40 ms 
post-stimulus. ANOVA F = 31.4,P = 4X 10° for 
virus-by-light interaction; n = 16 sites from 3 mice, 
t = 6.68, P = 0.0004; n = 17 sites from 3 mice, 
t = 1.57, P = 0.3, for Arch* and Arch” mice, 
respectively. d, e, Multi-unit activity traces from 
vHPC of Arch* mice during somatic (d) and 
terminal field (e) illumination in vivo. Yellow bar, 
light on (throughout). Error bars show + standard 
error of the mean (s.e.m.). 
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In Arch* but not Arch” mice, performance was impaired in Entire 
Trial and Sample Light conditions; Choice Light did not result in a 
statistically significant impairment (Fig. 2b). These data raised the 
possibility that vVHPC-mPFC input is critical for the encoding of 
location cues associated with the sample goal but may not be required 
for the retrieval of such cues. 

This native version of the T-maze task, however, is not optimized to 
discriminate between encoding and retrieval of the sample goal loca- 
tion, as the animal could begin forming a motor action plan (that is, 
“go into the opposite arm’) any time after it encounters the sample 
goal. To segregate better the encoding, maintenance, and retrieval 
phases within each trial, a modified, four-goal T-maze was con- 
structed (Fig. 2c). Here, as in the two-goal task, a single goal was made 
available for retrieval of the sample reward. During the choice run, the 
sample goal and one of the other three arms were open. This design 
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respectively). c, Four-goal DNMTP task. d, Effect of illumination of mPFC 
terminal fields on performance in the four-goal task (Arch’, n = 7 Arch’, n = 
6, ANOVA F = 3.1, P = 0.03 for virus-by-light interaction). Impairment was 
restricted to Sample Light trials (tf = 3.1, P = 0.0093; and t = 1.1, P= 0.29; f = 
1.0, P = 0.34; t = 1.91, P = 0.08; t = 1.2, P = 0.24, for No Light, Delay Light, 
Choice Light 10 s and Choice Light 20 s, respectively). No Light and Sample 
Light performance were significantly reduced in Arch* (t = 2.5, P = 0.04) but 
not Arch mice (t = 0.35, P = 0.73). Performance of Arch* mice during Sample 
Light runs was not significantly above chance (t = 1.9, P = 0.11). NS, not 
significant. Error bars show s.e.m. 
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prevents the mouse from formulating a spatially directed action plan 
until the choice phase, when, at the end of its centre arm run, it is 
presented with two of the four goals as options (Supplementary 
Video). Thus, selection of the choice goal was temporally restricted 
to the choice phase, separating out encoding from retrieval. 

Again, performance was impaired in the Sample Light condition, 
confirming the requirement for VHPC-mPFC input during encoding 
(Fig. 2d). In the Choice Light condition (repeated here with either 10 
or 20 s delays), a slight trend towards impairment did not reach 
significance, and post-hoc power analysis revealed that an n of 37 
animals per group would have been required to detect a paired dif- 
ference given the observed effect size. There was also no significant 
impairment with terminal illumination during the delay period 
(Delay Light; Fig. 2d). Although we cannot conclusively rule out an 
effect of terminal inhibition during choice, these experiments indicate 
that input from the vHPC to the mPFC is critical for encoding of 
spatial cues. 


Effect on spatial representation in mPFC 


The behavioural findings suggested that task-related spatial locations 
may be represented by firing rates in the mPFC in a VHPC-input- 
dependent manner. Therefore, recordings of multiple single units 
were obtained from the prelimbic region of the mPFC in mice per- 
forming the four-goal task (Extended Data Fig. 1). A total of 792 well- 
isolated single units were obtained from 9 mice. Forty-four per cent of 
mPFC single units displayed selectivity for one or both spatial dimen- 
sions that distinguished goal arms (left/right, back/front), and/or their 
interaction, as assayed by two-way analysis of variance (ANOVA) 
(Extended Data Fig. 2a—d). Given the inhomogeneous and distributed 
nature of the representation of spatial information among recorded 
mPFC units, a maximum margin linear classifier” was used to decode 
the sample goal location from binned population firing rate vectors 
and to quantify the strength and reliability of the neural representa- 
tion (Fig. 3). The classifier was cross-validated by training on data 
from half the trials and testing the model’s performance on data from 
the remainder (see Methods). 

The sample goal was decoded from spike histograms aligned to 
multiple trial events (Fig. 3a). Sample goal identity was decoded from 
the mPFC population at accuracies well above chance from the time 


Decoding goal location: sample run 
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the animals entered the T-intersection, and peaked at 96% upon 
arrival at the reward port in the sample goal (Fig. 3b). 

Sample goal representation at goal arrival time was then assessed 
during inhibition of VHPC input by training the model on firing rates 
from non-Sample Light trials (No Light, Delay Light and Choice Light 
trials combined) and testing on rates from Sample Light trials. For 
Arch” animals, the sample goal was decoded equally well with or 
without mPEC illumination, while in Arch* animals accuracy was 
reduced to chance by mPFC illumination (Fig. 3c). This result demon- 
strates that VHPC input is critical for the representation of the sample 
goal among mPFC units during encoding. 

To assess the impact of terminal inhibition on encoding of non- 
spatial, task-relevant cues, the same classifier was trained to decode 
the task phase (sample versus choice) at the time the start-box doors 
opened immediately before running down the centre arm. Because 
this epoch was behaviourally equivalent in sample and choice runs, an 
accurate representation of the task phase at this time point must rely 
upon a memory of the preceding task phase. Task phase was decoded 
with near perfect accuracy (0.98) at the time bin corresponding with 
the opening of the doors, revealing a memory trace for the preceding 
task phase (Fig. 3d). When the decoder was trained on firing rates 
from this epoch in No Light trials and tested on choice runs from 
Sample Light trials, model accuracy was not affected (Fig. 3e), sug- 
gesting that VHPC-mPFC terminal inhibition does not generally 
interfere with the encoding of task-relevant information. 

The finding of vHPC-input-dependent location coding in the 
mPFC leaves open the question of how vHPC input influences 
mPFC neurons. Overall firing rates of mPFC units were unaffected 
by terminal illumination in both Arch” and Arch* animals, whether 
all were considered together (Fig. 4a) or putative pyramidal cells and 
interneurons were separately classified on the basis of waveform fea- 
tures (Extended Data Fig. 4). Nonetheless, consistent effects of VHPC 
inputs might be revealed by a finer-grained analysis. 

Each unit’s preferred goal (that is, the goal in which firing rate was 
most different from the mean rate across the other three goals) was 
identified using weights generated by the classifier (Methods). 
Although this separation criterion allowed for the possibility that 
any given unit might represent location by an increase or decrease 
in rate, the difference was observed as an elevation in mean firing rate 
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Figure 3 | mPFC units require VHPC input to encode location but not task 
phase. a, Schematic of sample run. b, Accuracy of goal decoding during sample 
run with light off. Solid lines, mean decoder accuracy; shaded areas, 95% 
confidence intervals. n = 727 units from 9 mice. c, Decoding accuracy for 
sample goal upon arrival at the reward port (location ‘d’ in a) in the presence 
(filled bars) and absence (open bars) of illumination of VHPC-mPFC terminals 
(269 units from 4 Arch* mice, 285 units from 5 Arch” mice; n = 100 
permutations; ANOVA F = 1,978, P=5 X 10 ° for virus-by-light interaction; 
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t = 0.48, P = 0.64 for Arch’; *t = 161.2, P= 1.2 X 10"! for Arch’). Error 
bars represent 95% confidence intervals; blue lines represent upper bounds of 
95% confidence intervals for shuffled data. d, Decoding of task phase (sample 
versus choice) as a function of time relative to departure from the start box. 
Conventions as in b (792 units from 9 mice). e, Decoding of task phase at door 
opening (location ‘a’ in a) as a function of trial type (ANOVA, F = 1.94, P = 
0.17 for virus-by-light interaction). Error bars show s.e.m. 
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Figure 4 | Location selectivity requires VHPC 
input during encoding but not retrieval. 

a, Differences in firing rate with terminal 
illumination (n = 433 units, sign rank z = —0.23, 
P= 0.82; n = 359, z 0.97, P = 0.33, for Arch* 
and Arch’ animals, respectively). b, Mean binned 
spike rates for an example mPFC unit from an 
Arch* animal, aligned to arrival at each of the four 
sample goals (colour coded) without (left) or with 
(right) terminal illumination. c, Peri-event firing 


c Sample d SL rates during the sample phase for all location- 

8) arch m Pref. goal 8) Archt im Pref. goal selective units as they approach the preferred (Pref.; 
_ 6 m Other ae m Other red/grey) and non-preferred (Other; black) goals 
= = (blue asterisks, time points with Bonferroni- 
8 ‘ B4 corrected significance) during light on (right) and 
oO oO 
= 2 > 2 light off (left) trials. n = 67 units from 7 Arch* 
= stains = sia pos mice, and 78 units from 6 Arch’ mice. d, Same as 
3° . a , ° R37 7 pas =e 2 5 c, but during the choice phase. n = 145 units from 7 
g 7 sd ‘d EEE Arch” mice, and 77 units from 6 Arch* mice. 
5 Arch Pref. goal E 8) Arch Pref. goal 
=) € @ Other = || @ Other 
to) to} /\ 
c c /\ 
& 4 8 j | 
= = LAW | 

2 Perha 


5 0 5 5 10} 5 -5 0 5 -5 
Time from goal (s) 


relative to other goals, when averaged across all units (Extended Data 
Fig. 5), and especially those units with high goal selectivity (Fig. 4b-d). 
The effect in Sample Light conditions was striking. Inhibition of 
terminals during the sample phase prevented the increase in firing 
rate that occurred in units’ preferred goals, without affecting baseline 
rates (Fig. 4c, top). The same results were obtained when including all 
units in the analysis (Extended Data Fig. 5). These results reveal an 
excitatory role for VHPC input in the location-selective firing rate 
enhancement seen in mPFC units. 

To address whether direct VHPC input is necessary for all mPFC 
spatial representations, firing rate in preferred and non-preferred 
goals was examined during choice runs. As in sample runs, firing rate 
for highly goal-selective units during choice runs was also higher in 
the preferred goal than in non-preferred goals. However, unlike in 
sample runs, this elevation was unaffected by VHPC-mPFC terminal 
inhibition (Fig. 4d). In this finding, physiology mirrored the beha- 
vioural result, indicating that goal selectivity after encoding is no 
longer dependent on vHPC input. 


vHPC gamma organizes mPFC spike timing 


Activity in distant brain regions can be coordinated by gamma oscil- 
lations”, and long-range gamma synchrony in both the cortex and 
hippocampus has been linked to spatial learning and memory~*”. 
Therefore, long-range synchrony was quantified using the magnitude 
(pairwise phase consistency”® (PPC)) and significance (Rayleigh’s 
test”) of phase non-uniformity of spike times in one brain region 
relative to local field potential (LFP) oscillations in another. To deter- 
mine the temporal directionality of synchronous activity, lag analysis 
was performed, in which phase-locking was calculated at various 
temporal shifts; preferential phase-locking at a non-zero lag indicates 
a predictive relationship between oscillatory phase and spike timing. 

A subset of mPFC units were significantly phase-locked to VHPC 
gamma (Fig. 5b, c), and the percentage of significantly phase-locked 
units was greatest at lags in which VHPC gamma preceded mPFC 
spiking (Fig. 5c). Moreover, the mean strength of mPFC unit phase- 
locking to VHPC gamma was maximal at lags in which VHPC led 
(Fig. 5d-f). These findings suggest the possibility that gamma-fre- 
quency inputs from the vHPC influence mPFC spike timing. 
Consistent with this suggestion, inhibition of VHPC-mPFC terminals 
reduced the overall strength of gamma phase-locking (Fig. 5g, h), 
indicating that the observed synchrony is mediated by direct 
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vHPC-mPFC input. Importantly, this directionality was specific to 
gamma oscillations, as lag analysis of phase-locking in the theta range 
revealed an opposite directionality; mPFC led vHPC (but not dorsal 
hippocampus (dHPC)) activity in the theta range, and theta syn- 
chrony was unaffected by terminal inhibition (Extended Data Fig. 7). 

vHPC-mPFC gamma synchrony correlated with behaviour in two 
key ways. PPC values were higher during sample than choice runs, 
demonstrating that stronger gamma phase-locking is associated with 
the encoding phase of the trial (Fig. 5i). Additionally, phase-locking 
was stronger during sample runs of correct trials than of incorrect 
trials (Fig. 5j), suggesting that it may support effective encoding of 
location cues. 


Discussion 


We leveraged a temporally precise, projection-specific manipulation 
to test the role of the VHPC-mPFC afferent pathway in spatial 
working memory. We found that direct VHPC-mPFC input is essen- 
tial for successful encoding of task-related cues, both behaviourally 
and at the level of neural representation within the mPFC. VHPC- 
mPFC gamma synchrony correlated with successful cue encoding and 
was also disrupted by vHPC terminal inhibition. These findings point 
to a role for the VHPC-mPFC afferent pathway as a conduit for the 
updating of task-critical location cues. 

The finding of a dependence of task performance on vVHPC input 
only during sample runs, replicated across both behavioural para- 
digms used in this study, provides a strong argument for the import- 
ance of VHPC-mPFC afferent input during the trial phase in which 
relevant spatial cues are encoded. The role of VHPC-mPFC input in 
encoding goal location is further supported by the effect of terminal 
inhibition on the neural representation of goal location in the mPFC. 
Moreover, while contemporaneous location was robustly represented 
as a goal-selective enhancement of firing in preferred goals during 
both sample and choice runs, this representation depended on VHPC 
input only during the sample run. Here the physiology agrees with the 
behavioural observation that VHPC-mPFC input is critical for encod- 
ing, not retrieval, of task-relevant location cues. 

Interestingly, we find no evidence of retrospective location coding 
in mPFC activity (Fig. 3 and Extended Data Fig. 3). Within the para- 
meters of this study, this argues against persistent firing within mPFC 
neurons as a means of maintaining the stimulus representation 
between encoding and retrieval. Previously, retrospective and pro- 
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Figure 5 | Task-dependent modulation of mPFC spiking by VHPC gamma. 
a, Example raw and gamma-filtered VHPC LFP. b, Distribution of phase- 
locking values for all units from spikes recorded at all times, coloured by 
significance (Rayleigh’s test, P< 0.05). Insets, VHPC gamma phase histograms 
from example mPFC units (Cell 51: z = —3.24, P < 0.001, PPC = 0.0003; Cell 
338: z << —6, P< 0.0001, PPC = 0.002; Cell 324: z = —2.8, P = 0.002, PPC = 
0.001). ¢, Percentage of mPFC units significantly phase-locked to VHPC gamma 
across a range of lags. Dashed line, chance. d, Pseudocolour plot of normalized 
PPC values, sorted by lag of maximal phase-locking, for mPFC units with 
Bonferroni-corrected significance (P < 0.0029). e, Mean normalized PPC value 


spective location representations have been seen in mPFC cells of 
rodents during delays in spatial alternation tasks in which past and 
future location are intertwined”’”*, while in primates there is ample 
evidence for stimulus-specific delay-period firing in the dorsolateral 
PFC’. The primate dorsolateral PFC is functionally analogous to, 
but anatomically distinct from, the rodent prelimbic region of the 
mPFC**’. But retrospective place firing has been shown to be absent 
in the rodent mPFC when past and future location are independent”. 
While the delays used here were relatively brief (10-20 s), multiple 
previous studies have demonstrated a requirement for the mPFC at 
similar delays'*??*>. 

The combined results that VHPC-mPFC input is necessary only 
during cue encoding, that the mPFC lacks retrospective location rep- 
resentation during maintenance and retrieval, and that the mPFC 
representation of contemporaneous location during retrieval is not 
dependent on direct vVHPC input, suggest that the mPFC processes 
sample goal location transiently and that some downstream struc- 
ture(s) may maintain the information thereafter. Likely candidate 
structures would include the dHPC and the thalamic nucleus 
reuniens. We cannot, however, rule out the possibility that retrospect- 
ive location is persistently represented in the mPFC in a form not 
detectable using the analytic techniques presented here, such as in 
temporary changes in synaptic weights, or transient reactivation that 
is too brief to be reliably decoded from binned spike histograms. 

We find that VHPC-mPFC gamma synchrony is correlated with 
successful location encoding, and that inhibition of this input disrupts 
long-range gamma-frequency but not theta-frequency synchrony. 
vHPC gamma oscillations entrain local output in a phase-coherent 
manner (Extended Data Fig. 6), which influences mPFC spiking at 
putatively monosynaptic delays. These findings demonstrate that 
entrainment of mPFC spikes to VHPC gamma oscillations is a physio- 
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by lag. f, Distribution of lags at peak phase-locking strength; shifted towards a 
vHPC lead (n = 43 units, sign rank, z = —2.2, P = 0.014). Asterisk indicates 
mean lag. g, Distribution of gamma phases for spikes from an example mPFC 
unit from an Arch* animal during all light off runs (Rayleigh’s P = 0.03) and 
light on runs (Rayleigh’s P = 0.3). h-j, Change in phase-locking comparing 
light on versus off (1 = 140 units from 7 Arch* mice, z= —3.9,P=8.7X10 > 
and n = 222 units from 6 Arch’ mice, z = —1.83, P = 0.07) (h); choice versus 
sample phases (n = 458 units, z = —3.2, P = 0.0016) (i); and correct versus 
incorrect trials (n = 270 units, z = —4.2, P = 3.5 X 10”) (j). Significance by 
sign rank. Error bars shows s.e.m. 


logical signature of task-critical long-range signal propagation. This 
phenomenon is subtle—at the lag with the greatest phase-locking, 
fewer than 10% of mPFC units were significantly phase-locked to 
vHPC gamma. Nevertheless, the observed correlation between 
vHPC-mPFC gamma synchrony and successful encoding suggests 
that gamma synchrony could be a behaviourally relevant marker of 
effective long-range functional connectivity”. 

The finding that mPFC theta activity leads VHPC theta, and that 
theta synchrony between the two structures did not appear to depend 
upon vVHPC-mPFC afferents, ran counter to our initial hypotheses. 
Previous work had shown that dHPC theta leads mPFC theta!®”°, 
theta waves travel from the dHPC to vHPC*””’, and silencing of 
vHPC activity affects dHPC-mPFC theta synchrony”’, supporting 
the idea that the vHPC theta-patterned activity might directly entrain 
mPFC theta. The current findings regarding theta synchrony suggest 
that there may be an alternative explanation for the previous finding 
of reduced dHPC-mPFC synchrony with vHPC silencing. It is pos- 
sible that pharmacological inactivation of VHPC affected downstream 
targets in the dHPC, disrupting dHPC theta activity and thus dHPC- 
mPFC theta synchrony. This possibility is supported by the finding 
that vVHPC silencing with muscimol reduces dHPC theta power”. 

Our findings point to a role for the VHPC-mPFC afferent pathway 
as a conduit for the updating of task-critical location cues, extending 
previous work that implicates the VHPC and mPFC in contextual 
learning to short timescales*”’. Future work in this area should seek 
to implicate intermediary and/or upstream structures in the transmis- 
sion of theta-patterned activity between the HPC and mPFC, to deter- 
mine what (if any) role the mPFC has in maintenance and/or retrieval 
of task-related cues, and to identify which of the other major inputs 
to the mPFC serve to mediate the performance of spatial working 
memory. 
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METHODS 


Subjects. Male C57BL/6 mice (Jackson Labs) were used for all experiments, aged 
8-12 weeks at first use. Mice were housed in a New York State Psychiatric 
Institute satellite facility and were maintained on a 12-h light-dark cycle. 
Except when food-restricted for the purpose of behavioural training and testing, 
all mice were given ad libitum access to food and water. Pre-surgical mice were 
group-housed with littermates, while mice with chronic recording implants were 
singly housed in divided cages with visual, auditory and olfactory contact with 
another implanted mouse. Six mice were used in the acute anaesthetized experi- 
ment (3 Arch*, 3 Arch’), 14 in the two-goal T-maze experiment (8 Arch”, 6 
Arch’), and 13 in the four-goal T-maze experiment (7 Arch*, 6 Arch’). The effect 
size for behavioural impairment in the Arch* group in the four-goal experiment 
was 1.48, and the probability of observing a significant effect (statistical power) 
was 90% for an N of 7, 83% for an N of 6 (Arch* and Arch™ group sizes, 
respectively). Mice were randomized to a given viral type. All procedures were 
approved by Columbia University and the New York State Psychiatric Institute 
Institutional Animcal Care and Use Committees. 

Surgical preparation. Animals were placed inside a flow box and anaesthetized 
with isoflurane gas (2%) until sedated, at which point they were placed in a 
stereotax and maintained on 0.5% isoflurane for the duration of the surgery. 
Craniotomies were made bilaterally above the mPFC, dHPC and vHPC (coordi- 
nates below), and skull screws placed over the cerebellum and olfactory bulb 
served as ground and reference, respectively. In the acute anaesthetized experi- 
ment, surgical depth was maintained with isoflurane for the duration of the 
experiment. 

Viral transduction. AAV2/5 of titre exceeding 10’* vg ml (K. Deisseroth via 
UNC Vector Core and UPenn Vector Core) was used to package the virus. In 
the acute stimulation/silencing experiment and the two-goal T-maze experiment, 
a CamKIIa-eArch3.0-eYFP sequence was used to express the opsin and 
CamKIIa-mCherry was used as an opsin-negative control. For the four-goal 
experiment, hSyn-eArch-eYFP and hSyn-eYFP were used for opsin and control, 
respectively. The hSynapsin promoter was chosen for the four-goal experiment to 
account for possible long-range GABAergic VHPC-mPFC projections and to 
avoid potential toxicity effects resulting from opsin expression under the stronger 
CamKIIa promoter. Virus was targeted to multiple targets within the stratum 
pyramidal of ventral CA1 (two mediolateral rows at anterioposterior (AP) 2.95 
and 3.25, with sites at mediolateral (ML)/dorsoventral (DV): 2.65/4.5, 3.0/4.3, 
3.35/3.9, 3.7/3.3-2.9. An additional row was made at AP 3.1, with ML/DV sites at 
2.8/1.55 and 3.15/1.7. All coordinates are reported in mm, all AP and ML coor- 
dinates are with respect to bregma, DV coordinates with respect to brain surface. 
A 200 nl bolus was delivered to each site via glass micropipette (20-40 14M 
diameter) at a rate of 100 nl min! continuous infusion, with a wait time of 
5 min between infusion and retraction. 

Electrode and fibre implantation. LFPs were recorded using 50-j1M-diameter 
tungsten wire, while spikes and LFPs were recorded using stereotrodes (mPFC) 
and tetrodes (vVHPC) made from 13-,1M-diameter tungsten fine wire. For the two- 
goal T-maze experiment, stereotrodes were coupled to ferrule-bound optical 
fibres (Thorlabs, 200-j1M-diameter core, 0.39 NA) positioned 300-500 11M dorsal 
to the stereotrode tips, which were arrayed semi-circularly around the lateral edge 
of the fibre. Fibre-coupled stereotrode bundles were then implanted bilaterally in 
the mPFC (1.8 mm anterior, 0.4 mm lateral, 1.4 mm ventral), while LFP wires 
were implanted bilaterally in the dHPC (1.85 mm posterior, 1.25 mm lateral, 1.45 
mm ventral) and the vHPC (3.1 mm posterior, 3.0 mm lateral, 3.9 mm ventral). 
For the four-goal experiment, 9 of the 13 animals were implanted as described 
earlier, while 4 animals (2 Arch* and 2 Arch’) were instead implanted with 
tetrodes over the VHPC (3.7 mm ventral), which were advanced until spikes with 
putative pyramidal-cell waveforms were detected (Extended Data Fig. 4). mPFC 
optical fibres in these 4 animals were coupled to stationary bilateral LFP wires. All 
movable microdrives were advanced at a rate of 40 1M per day across all record- 
ing days. Recording sites were histologically confirmed by visual examination of 
electrothermolytic lesions made before killing and perfusing implanted animals. 
Lesions were induced by passing current through an electrode at each implanted 
site (50 A, 20 s). Perfused and fixed tissue was then sectioned, and DNA was 
stained using 4’ ,6-diamidino-2-phenylindole (DAPI) Fluoromount-G mounting 
medium (Southern Biotech). VHPC LFP wires used for phase-locking analysis 
were located in the stratum pyramidale, stratum radiatum or stratum lacunosum- 
moleculare of ventral Cal; no differences in gamma power, peak frequency or 
synchrony with mPFC units were seen for the different laminar locations. 
Recording and spike sorting. Recordings were amplified, band-pass filtered (1- 
1,000 Hz LFPs, 600-6,000 Hz spikes), and digitized using the Neuralynx Digital 
Lynx system. LFPs were collected at a rate of 2 kHz, while spikes were detected by 
online thresholding and collected at 32 kHz. Units were initially clustered using 
Klustakwik (Ken Harris), sorted according to the first two principal components, 
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voltage peak and energy from each channel. Clusters were then accepted, merged 
or eliminated based on visual inspection of feature segregation, waveform dis- 
tinctiveness and uniformity, stability across recording session, and inter-spike 
interval distribution. 

Behavioural training. Mice undergoing behavioural testing were given 5 weeks 
to recover from surgery and allow for expression and transport of the opsins, at 
which time they were placed on a food-restricted diet consisting of 1.5-2.5 g of 
food per day as needed to maintain 85% of their post-recovery bodyweight. The 
researcher responsible for training and running the mice was blinded as to viral 
group by another laboratory member, who randomly assigned opsin and control 
viruses. Mice were then given 2 days of habituation to the maze, which consisted 
of 10 min free exploration and foraging (plugged into optical fibres and recording 
tether) with all doors open, followed by 5 min of interleaved laser light pulses in 
the start box with doors closed (5-10 s on, 30 s off, 5 min). On the subsequent 2 
days mice underwent behavioural shaping consisting of 10 min of running to 
baited goal arms in alternating directions. Mice then underwent training on the 
T-maze task until criterion performance (consisting of 70% correct trials on 2 out 
of 3 consecutive days) was achieved, in the absence of illumination. All animals 
that underwent successful surgeries met criterion. Inter-trial delay was 20 s. On 
the two-goal experiment, reward consisted of dustless pellets (Bio-Serv). Also in 
the two-goal experiment only, to minimize across-session performance drift and 
ceiling effects, intra-trial delay was adjusted according to the preceding day’s 
performance, beginning at 10 s and progressively increasing delay by an addi- 
tional 5 s following days on which performance was above 80% (as in ref. 12). For 
the four-goal experiment, reward consisted of sweetened condensed milk 
(~50 pl, 3:1 dilution). To allow for direct comparison of delay within animals 
in this task, intra-trial delay was fixed at 10 s, with the addition ofa single session 
of 20 s delays as the final session for each animal. Light stimulation was 532 nm, 
10 mW in all cases. Task performance was lower when sample and choice goals 
matched in the left/right dimension than when they differed (n = 13 mice; mean 
= 0.65 + 0.01 and 0.71 + 0.02, respectively; t = —3.91, P = 0.001), but per- 
formance on these trials remained above chance (t = 4.0 X 10°, P= 3.1 X 10-*°). 
Statistics. All effects presented as statistically significant exceeded an «-threshold 
of 0.05. All independence tests were two-tailed. All independence testing of paired 
values (that is, changes across conditions) used paired t-tests or (where stated) 
signed rank tests. All ANOVA tests involving multiple observations per subject/ 
unit (every figure except Extended Data Fig. 2d) were done as repeated measures 
group-by-condition tests. All t-tests and rank tests performed with more than two 
groups were done post-hoc to ANOVA tests except where Bonferroni correction 
for multiple comparisons is specifically cited. 

Spike analysis using a linear classifier. A maximum margin linear classifier was 
used to decode sample goal location from binned population firing rate vectors 
and to quantify the strength and reliability of the neural representation. This type 
of classifier had the advantage of integrating comparisons between each pair of 
goal locations into a single inference, thus capturing all possible schemes of 
location discrimination in a way that can be applied across a diverse neural 
population. Matlab scripts are available for sharing upon request. 

Analyses were performed on all units for which there was at least one training 
and one testing trial per condition. 

Analysis using the population decoder was performed on binned spike vectors 
(500 ms bins, 100 ms increments) of all recorded units from sessions with at least 
two trials for each feature class under consideration. As in previous work with 
spike data from the PFC, we therefore took advantage of the accumulated data to 
decode task-relevant features by treating all units as a single, pseudosimulta- 
neously recorded population”. This approach assumes equivalent spike statistics 
across animals and recording sessions, and ignores potential contributions to 
population coding from correlated spike variability that would be observed if 
all units were indeed simultaneously recorded™. 

Model training was performed using constrained quadratic programming 
that employed a maximal margin perceptron***. The training samples to this 
algorithm are generated by averaging the recorded spike counts within the rel- 
evant conditions that need to be decoded, and across a specified training set of 
trials. This procedure gives a mean activity vector per condition, in which each 
component of the vector represents the trial-averaged activity of a given neuron at 
a given condition. The quadratic programing procedure then aims at finding a set 
of readout weights that maximally separate the mean activity vectors correspond- 
ing to the conditions that have to be discriminated. Multi-class discrimination 
problems are reduced to a set of binary discrimination problems involving all 
pairwise combinations of conditions”. 

The model was then tested by cross-validating its performance on a test set of 
recorded trials. Given a task condition, a test vector is generated by sampling its 
components from the distribution of spike counts recorded from the correspond- 
ing neuron during the test trials. At each test phase 100 test vectors per condition 


45,46 
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are randomly resampled with replacement, and the performance of the model is 
quantified as the average accuracy in classifying them. For each time bin, model 
training and testing was performed 100 times (at which point estimates of model 
accuracy approached asymptote) on non-overlapping subsets of trials (half of 
trials to train, half to test, random subsampling without replacement), with sub- 
sets constrained to include at least one trial corresponding with each feature class 
under consideration. For training and testing across separate trial conditions 
(Fig. 4), the same trial number requirement was applied across training and 
testing sets to ensure equal population sizes and equal representation of feature 
classes in the two trial sets. 

For identification of each unit’s preferred goal, absolute values of model 
weights for each goal’s three binary classifications (comparison with each other 
goal) were summed, and the goal with the highest summed value was judged to be 
the preferred goal. To segregate units with high and low goal selectivity, the 
absolute model weights for all six binary classifications (each goal-goal compar- 
ison) were summed; units in the upper and lower quintiles were judged to have 
high and low goal selectivity, respectively. 

Phase-locking analysis. Phase-locking of spikes to the oscillatory phase of LFPs 
was performed using pairwise phase consistency, which, unlike other commonly 
used measures of phase-locking, is unbiased by spike number’®. Nevertheless, to 
ensure a representative estimate of spike phase, we set a threshold of 100 spikes for 


all analysis; in comparisons of phase-locking across conditions (Fig. 5h-j and 
Extended Data Fig. 7h), only units that fired 100 spikes in each condition were 
included. LFP signal was digitally band-pass filtered (4-12 Hz for theta, 30-70 Hz 
for gamma) using a zero-phase-delay filter (filter provided by K. Harris and G. 
Buzsaki, order = sample frequency). The phase component was calculated by a 
Hilbert transform, and a corresponding phase was assigned to each spike. 
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Extended Data Figure 1 | Individual mPFC units clustered from fibre-coupled stereotrodes. a, Multiple individual units clustered from stereotrode recordings 


in the mPFC in the absence and presence of illumination. b, Mean waveforms of extracellular potentials from example units in a. 
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Extended Data Figure 2 | mPFC cells encode goal location both 


categorically and globally. a, A raster plot of spikes fired by an example single 


unit across trials, sorted by sample goal, temporally aligned to arrival at 
sample goal. b, Traces of firing rates averaged across trials by sample goal 
location, for the unit from a. This unit shows location selectivity, firing 
preferentially in the back left goal. Traces are mean + s.e.m. c, Spatial map 
of firing rates for the same unit for the full recording session. Goal-selective 


units tended to fire more at the preferred goal than at the other goals, and more 
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at all goals than in the rest of the environment. d, Percentage of units that 
were goal-selective as a function of time from sample goal, according to 
two-way repeated measures ANOVAs performed on binned spike rates. 
Units were identified as having selectivity for left/right (blue), back/front (red), 
and/or combined spatial dimensions (green). Dashed line represents chance 
(P = 0.05). Inset, percentage of units having each type and/or combination of 
selectivity at time zero (arrival at sample goal). Percentages are out of 792 
recorded units. 
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Extended Data Figure 3 | mPFC units represent choice goal location, not 
sample goal location, during choice runs. a, Model accuracy at the time bin 
corresponding with arrival at the sample goal port during the four-goal task was 
highest for spike histograms with time bins of 500 ms and 1,000 ms. Five- 
hundred-millisecond time bins were used for spike analyses. b, Decoding 
sample goal location during subsequent choice run during the four-goal task. 
Using the linear decoder, previously visited location was not detectable above 
chance accuracy. Ten- and twenty-second delay trials were combined. 

c, Decoding choice goal during choice run, correct versus incorrect trials during 
the four-goal task. Location decoded for this analysis was chosen goal (that is, 
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the mouse’s current location) rather than correct goal. Model accuracy reached 
0.93 upon arrival at the goal on correct trials. On incorrect trials, model 
accuracy exceeded chance during goal approach but dropped to chance levels 
upon reaching the goal. Ten- and twenty-second delay trials were combined. 
d, Decoding choice accuracy (correct versus incorrect) during choice trials. 
Model accuracy peaked at 0.99 at 1.9 s after arrival at the goal. b-d, Histograms 
were aligned to departure from start box. Ten- and twenty-second delay trials 
were combined. Data show mean + 95% confidence intervals for b and d; s.e.m. 
for c ande. 
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Extended Data Figure 4 | VHPC-mPFC terminal inhibition does not 
alter mPFC spike rate. a, Waveform features used to separate putative 
cell types. Spike duration was defined as the peak-to-trough time, while 
afterhyperpolarization (AHP) energy was taken as the area over the curve after 
the second zero-crossing. Spike duration yielded the clearest separation. 
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b, Putative fast-spiking (FS) and non-FS cells, sorted by spike width, showed 
no effect of terminal illumination on spike rate (Arch” non-FS: sign rank 

z= —1.7, P = 0.095; Arch” FS: z = —1.6, P = 0.11; Arch* non-FS: z = —2.7, 
P = 0.79; Arch” FS: z = —0.49, P = 0.62). 
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Extended Data Figure 5 | Effect of mPFC illumination on goal-selective 
firing in the mPFC. a, Low-weighted units, as identified using the classifier, 
show no difference in firing between the goal with the highest weight relative to 
the other goals. In the sample goal these units fire at rates not different than 
their session mean rates. Traces indicate mean + s.e.m. of normalized firing 
rate (bin FR — session FR). b, Terminal inhibition eliminates firing rate 
differences in preferred (Pref.) versus non-preferred (Other) goal during 


encoding across all units. On sample runs with no light, units from both Arch” 
(bottom left) and Arch™ animals (top left) had elevated firing rates in preferred 
goal relative to non-preferred goal (red asterisks mark time points with 
Bonferroni-corrected significance). In Sample Light runs, units from Arch™ 
animals maintain elevated firing in the preferred goal (bottom right), while 
units from Arch* animals show no significant firing rate difference (top right; 
N = 358 Arch” units, 325 Arch* units, sign rank P < 0.0005). 
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Extended Data Figure 6 | VHPC gamma modulates vHPC output. a, vHPC —_ Units maximally phase-locked at a lag of zero, with no net difference from zero 


units phase-lock maximally to the VHPC gamma rhythm at a lag of zero across the population. e, VHPC units share a common preferred gamma phase. 
(P value from Rayleigh’s test < 0.05, dashed line indicates chance rate). Pooled spikes from significantly phase-locked vVHPC units were modulated by 
b, Normalized PPC values, sorted by lag of maximal phase-locking, for vHPC gamma phase at zero-lag (N = 26,303 spikes, Rayleigh’s z = 17.6, P = 2.2 
significantly phase-locked vVHPC units. Units with Bonferroni-corrected x 10°, PPC value = 0.002), with peak spiking in the descending phase of the 
significance within the —40 to 40 ms lag window (Rayleigh test, P< 0.0029) | gamma cycle. (Note that spikes and LFPs were both recorded from stereotrodes 
were included. c, Mean normalized PPC value for the population shown in in the stratum pyramidale and that this gamma phase would probably differ 


b. Shading is s.e.m. d, Histogram of units with maximum PPC value at each lag. _ from that recorded in SLM, as in Fig. 5). 
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Extended Data Figure 7 | mPFC theta activity follows dHPC and leads 
vHPC during the task. a, Example VHPC LFP (blue, right) and spectrogram 
(left) demonstrating robust theta (grey, 4-12 Hz) and gamma (red, 30-70 Hz) 
components during all runs towards goals. b, Pseudocolour plot of relative 
strength of mPFC unit phase-locking to VHPC theta at lags from —200 ms to 
200 ms, for units with Bonferroni-corrected significance in at least one lag. 
Warmer colours indicate stronger phase-locking. c, Distribution of lags at peak 
phase-locking strength for significantly phase-locked mPFC units. Distribution 
centred at 0 (N = 189 units, z = 2.05, P = 0.98). d, Mean + s.e.m. PPC value 
of mPFC units and vHPC theta, as a function of lag. e-g, Phase-locking of 
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mPFC units to dHPC theta as a function of lag, as in b—d. Distribution of lags 
at peak phase-locking is significantly shifted towards a dHPC lead (N = 160 
units, sign rank z= —4.4,P = 6 X 10°). h, No difference in strength of phase- 
locking of mPFC units to VHPC (left) and dHPC (right) theta in light on 
versus light off trials. Mean and s.e.m. shown for each (N = 140 units, sign 
rank z 1.3, P = 0.2; z 1.4, P = 0.12). i-k, Phase-locking of VHPC 
units to mPFC theta as a function of lag, as in b-d. Distribution of lags at 
peak phase-locking is significantly shifted towards an mPFC lead (N = 51 units, 
z= -—5.03,P=2.4X 10 ”). 
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A novel multiple-stage antimalarial agent 
that inhibits protein synthesis 
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Simon Campbell!, Alan H. Fairlamb’, Paul A. Willis'’, Julian C. Rayner*, David A. Fidock”"®, Kevin D. Read! & Ian H. Gilbert! 


There is an urgent need for new drugs to treat malaria, with broad therapeutic potential and novel modes of action, to 
widen the scope of treatment and to overcome emerging drug resistance. Here we describe the discovery of DDD107498, 
a compound with a potent and novel spectrum of antimalarial activity against multiple life-cycle stages of the 
Plasmodium parasite, with good pharmacokinetic properties and an acceptable safety profile. DDD107498 demon- 
strates potential to address a variety of clinical needs, including single-dose treatment, transmission blocking and 
chemoprotection. DDD107498 was developed from a screening programme against blood-stage malaria parasites; its 
molecular target has been identified as translation elongation factor 2 (eEF2), which is responsible for the 
GTP-dependent translocation of the ribosome along messenger RNA, and is essential for protein synthesis. This 


discovery of eEF2 as a viable antimalarial drug target opens up new possibilities for drug discovery. 


The World Health Organization estimates that there were approxi- 
mately 200 million clinical cases and 584,000 deaths from malaria in 
2013, predominantly among children and pregnant women in sub- 
Saharan Africa’. The malaria parasite has developed resistance to 
many of the current drugs, including emerging resistance to the core 
artemisinin component of artemisinin-based combination therapies 
that constitute current first-line therapies’. To support the current 
treatment and eradication agenda® , there are several requirements for 
new antimalarials: novel modes of action with no cross-resistance to 
current drugs; single-dose cures; activity against both the asexual 
blood stages that cause disease and the gametocytes responsible for 
transmission; compounds that prevent infection (chemoprotective 
agents); and compounds that clear Plasmodium vivax hypnozoites 
from the liver (anti-relapse agents)*. 


Discovery of a novel antimalarial 


A phenotypic screen of the Dundee protein kinase scaffold library” 
(then 4,731 compounds) was performed against the blood stage of the 
multi-drug-sensitive Plasmodium falciparum 3D7 strain. A com- 
pound series from this screen, based on a 2,6-disubstituted quin- 
oline-4-carboxamide scaffold, had sub-micromolar potency against 


the parasites, but suffered from poor physicochemical properties. 
Chemical optimization (Fig. 1 and Extended Data Fig. 1) led to 
DDD107498 with improved physicochemical properties (Tables 1 
and 2 in Supplementary Methods) and a 100-fold increase in 
potency. The key stages involved were replacing the bromine with a 
fluorine atom to reduce molecular mass and lipophilicity; replacing 
the 3-pyridyl substituent with an ethyl-pyrrolidine group; and addi- 
tion of a morpholine group via a methylene spacer. Initial cost-of- 
goods estimates and likely human dose projections suggest a low cost 
(approximately US$1 per treatment), which is important, given that 
most of the patient population is living in poverty. 


Blood-stage activity and developability 


DDD107498 showed excellent activity against 3D7 parasites: 50% 
effective inhibitory concentration (ECs9) = 1.0 nM (95% confidence 
interval (CI) 0.8-1.2 nM); ECog = 2.4 nM (95% CI 2.0-2.9 nM); 
ECg9 = 5.9 nM (95% CI 4.5-7.6 nM), (n = 39). It was also almost 
equally active against several drug-resistant strains (Extended Data 
Fig. 2a)°. Furthermore, DDD107498 was more potent than artesunate 
in ex vivo assays against a range of clinical isolates of both P. falci- 
parum (median ECs, = 0.81 nM (range 0.29-3.29 nM), n = 44) and 
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Hit from screen 


DDD102542 


P{(3D7) ECs 0.12 uM 
Calculated log(P) 4.3 
Solubility not determined 
Mouse cli. 5.3 ml min“? gt 


Pf(3D7) EC,, 0.35 uM 
Calculated log(P) 3.7 
Solubility 36 uM 

Mouse cli. 8.6 ml min“? gt 


DDD103679 DDD107498 
Pf(3D7) ECs 0.70 uM 
Calculated log(P) 3.7 
Solubility 180 uM 

Mouse cli. 3.4 ml min“! gt 


Pf(8D7) EC 0.001 uM 
Calculated log(P) 3.2 
Solubility 216 uM 

Mouse cli. < 0.9 ml min-' g™ 


Figure 1 | Chemical evolution of DDD107498 from the phenotypic hit. Calculated log(P), calculated log(partition coefficient); Solubility, solubility in water; 


Mouse cli., intrinsic clearance in mouse liver microsomes. 


P. vivax (median ECs) = 0.51 nM (range 0.25-1.39 nM), n = 28) 
collected from patients with malaria from southern Papua, Indonesia, 
a region where high-grade multidrug-resistant malaria is endemic for 
both species (Extended Data Fig. 2b)”*. In contrast, the compound was 
not toxic to human cells (MRC5 and Hep-G2 cells) at much higher 
concentrations (>20,000-fold selectivity; Extended Data Fig. 2c). 

DDD107498 showed good drug-like properties: metabolic stability 
when incubated with hepatic microsomes or hepatocytes from several 
species; good solubility in a range of different media; and low protein 
binding (Tables 1 and 2 in Supplementary Methods). DDD107498 
displayed excellent pharmacokinetic properties in preclinical species, 
including good oral bioavailability (an important prerequisite for 
use in resource-poor settings) and long plasma half-life (important 
for single-dose treatment and chemoprotection) (Extended Data 
Table la). 

DDD107498 was very active in several mouse models of malaria, 
with comparable or greater efficacy than current antimalarials 
(Extended Data Table 1b). DDD107498 had a 90% reduction in 
parasitaemia (EDg9) of 0.57 mg per kg (body weight) after a single 
oral dose in mice infected with the rodent parasite Plasmodium 
berghei. Efficacy was also tested in NOD-scid IL-2R_null mice 
engrafted with human erythrocytes and infected with P. falciparum 
strain 3D7°°”/“? (Fig. 2a)’. When orally dosed daily for 4 days, the 
EDgo on day 7 after infection was 0.95 mg per kg per day. Blood 
sampling from the infected SCID (severe combined immunodefi- 
ciency) mice suggested a minimum parasiticidal concentration for 
DDD107498 of 10-13 ng ml ' for asexual blood-stage infections. 

The effects of DDD107498 on circulating parasites in the SCID 
mouse model could be observed in one replication cycle (48 h) and 
led to trophozoites with condensed cytoplasm (Extended Data Fig. 3). 
Stage specificity studies using synchronized cultures showed that, at a 
concentration of 4 nM for 24 h, DDD107498 led to (1) from the ring 
stage, formation of abnormal trophozoites; (2) from the trophozoite 
stage, prevention of schizont formation with a 50% reduction in para- 
sites, indicative of cidal activity; and (3) from the schizont stage, 
prevention of ring formation with a 98% reduction in parasites, indi- 
cative of cidal activity (Extended Data Fig. 3b, c). 

DDD107498 showed a similar parasite killing profile both in vitro 
(Fig. 2b, c) and in vivo (Fig. 2a), which is supportive of a common 
mode of action in cellular and animal models of disease. Using a 
parasite reduction rate assay’ there was a lag of about 24-48 h, during 
which time the effects of the compounds were reversible following 
wash-out. Rapid killing occurred after parasites had been exposed to 
DDD107498 for more than 48 h (Fig. 2b, c). 

All these experiments suggest for the blood-stage form that treat- 
ment with DDD107498 prevented development of trophozoites and 
schizonts and, at least in the case of schizonts, caused rapid killing. 
Any ring-stage parasites only developed as far as abnormal tropho- 
zoites, which appeared to survive for about 48 h under drug pressure, 
but were then killed. 

In safety studies, DDD107498 showed no clinically relevant inhibi- 
tion of any of the major human cytochrome P450 (CYP) isoforms and 
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Figure 2 | Efficacy studies and parasite killing rate. a, In vivo activity against 
P. falciparum in NOD-scid IL-2R_null mice. Six mice were used, each serially 
sampled. The variability of cytometry for repeated acquisitions of a sample 

is less than 2-3%. The percentage of parasitaemia was calculated by acquiring a 
minimum number of 500 parasitized erythrocytes. An independent experiment 
with three further mice was performed to confirm the EDgy. b, Determination 
of the in vitro killing rate of DDD107498. The in vitro parasite reduction rate 
assay was used to determine onset of action and rate of killing as previously 
described”. P. falciparum was exposed to DDD107498 at a concentration 
corresponding to 10 X ECs9. The number of viable parasites at each time point 
was determined as described’®. Four independent serial dilutions were done 
with each sample to correct for experimental variation; error bars, s.d. Previous 
results reported on standard antimalarials tested at 10 X ECso using the same 
conditions are shown for comparison”. c, The in vitro parasite reduction 
rate assay was used to determine the minimal concentration of compound 
needed for achieving maximal killing effects. Parasites were exposed to 
DDD107498 at concentrations of 0.1, 0.3, 1, 3 and 10 X ECso using conditions 
described above. Error bars, s.d. Concentrations of DDD107498 of 1 X ECs, are 
sufficient to produce maximal killing effects on treated parasites. 
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Figure 3 | In vitro activity of DDD107498 against P. berghei liver stages. Each experiment was the average of four technical replicates; h.p.i., hours 


post-infection. Bars, 95% CI. *More than one curve fitting possible. 


CYP450 induction risk was low (Table 3 in Supplementary Methods), 
indicating a low risk of clinical drug-drug interactions. DDD107498 
is non-mutagenic and has very weak inhibitory potencies on Ix; 
(hERG) and other ion channels, indicating a very low risk for adverse 
cardiovascular activity. Given its potency, long half-life observed in 
preclinical species, and safety margins from a rat 7-day toxicology 
study, DDD107498 has potential both for single-dose treatment and 
for once-weekly chemoprotection". 


Activity against other life-cycle stages 


Intra-hepatocytic parasites (liver schizont stages) are the first stage of 
human infection after injection of sporozoites by anopheline mosqui- 
toes. Compounds active against this stage have potential for use 
in chemoprotection. DDD107498 showed an ECso ~ 1 nM against 
the liver schizont forms of P. berghei and Plasmodium yoelii'’. 
DDD107498 was active when dosed for only 2 h during the initial 
infection (hepatocyte invasion) of the liver cells (Fig. 3). In contrast, 
atovaquone (clinically used for chemoprotection in combination 
with proguanil) had a much reduced activity during this period 
(ECs9 ~ 106 nM versus 0.3 nM for continuous treatment). Further, 
DDD107498 showed equivalent potency against the P. berghei liver 
stage when added after initial infection had been established (Fig. 3). 
This suggests that intermittent treatment may be sufficient for che- 
moprotection. To assess chemoprotective potential in vivo, mice were 
treated with DDD107498 2 h before being infected with luciferase- 
expressing P. berghei sporozoites (Extended Data Fig. 4). Ata dose of 3 
mg per kg, DDD107498 was fully curative with no sign of parasitae- 
mia after 30 days. Thus, DDD107498 demonstrates potent chemo- 
protection using in vitro and in vivo models, where blood sampling 
from the mice during the experiment suggests a minimum parasiti- 
cidal concentration of 15-20 ng ml. 

The parasite erythrocytic form differentiates into the asymptomatic 
male and female gametocytes (stages I-V) within the human host. 
Mature stage V gametocytes are infective to mosquitoes, but are 
not eliminated by the majority of current antimalarial agents, and 
remain circulating in the human bloodstream for up to 3 weeks, long 
after the disappearance of clinical symptoms of malaria’*’*. After 
ingestion by the mosquito, gametocytes differentiate into gametes. 
DDD107498 potently inhibited both male and female gamete forma- 
tion from the gametocyte stage at similar concentrations (1.8 nM 
(95% CI 1.6-2.1 nM) and 1.2 nM (95% CI 0.8-1.6 nM) respectively; 
Extended Data Fig. 5), indicating that it is an extremely potent inhib- 
itor of the functional viability of both male and female mature game- 
tocytes’’. In line with this, DDD107498 blocked transmission, as 
determined by the standard membrane feeding assay. In this assay, 
parasite cultures containing P. falciparum stage V gametocytes were 
exposed to compound for 24 h before mosquito feeding. DDD107498 
blocked subsequent oocyst development in the mosquito (measured 


after 7 days) with an ECs of 1.8 nM. Ata baseline oocyst intensity of 
27 oocysts per mosquito in the DMSO controls, prevalence of infec- 
tion was inhibited, with an ECso of 3.7 nM, as measured by the 
number of infected mosquitoes. Repeating the standard membrane 
feeding assay in which DDD107498 was added at the moment of 
mosquito feeding gave an ECs, of 10 nM, indicating potent activity 
against the parasite sexual stages that develop in the mosquito midgut 
(Extended Data Fig. 5)’*. 

A P. berghei mouse-to-mouse’”’* model was additionally used to 
examine transmission blockade. Mice were infected with P. berghei 
(PbGFPCONS507)”", and then treated orally with compound 24 h 
before mosquitoes took a direct blood meal’’. After a 3 mg per kg 
dose of DDD107498, a 90.7% reduction in infected mosquitoes and a 
98.8% reduction in oocysts per midgut was observed at day 10 com- 
pared with mosquitoes fed on untreated mice (Extended Data Fig. 6). 
A corresponding reduction in sporozoite intensity and prevalence was 
observed (93.8% and 88.6% respectively). Mosquitoes previously fed 
on infected, drug-treated mice were allowed to feed on uninfected 
mice’’. Testing a range of mosquito biting rates, we observed a mean 
reduction of 89.5% (95% CI 71.4-100) in the number of mice that 
developed blood-stage infection compared with mice bitten by mos- 
quitoes that had fed on non-drug-treated infected mice. The overall 
effectiveness of an intervention over a round of transmission (from 
mouse to mosquito to mouse) can be quantified by estimating its 
ability to reduce the basic reproductive number (Ro). This has been 
termed the ‘effect size’ of an intervention. By fitting data from the 
mouse-to-mouse assay to a chain binomial model we can estimate the 
effect size of the intervention”, assessing the ability of DDD107498 
usage to reduce the basic reproductive number Rp (assuming 100% 
coverage). Our results estimate an effect size of 90.5% (95% CI 
78.3-94.2), suggesting that DDD107498 is capable of acting as a 
potent transmission-blocking drug over multiple transmission set- 
tings within a field context'*. The combination of these key in vitro 
and in vivo assays demonstrates the very strong potential of 
DDD107498 for blocking transmission; importantly, the required 
doses are likely to be similar to those required for treatment of 
blood-stage malaria. 


17,18 


DDD107498 targets PfeEF2 


To determine the molecular target for DDD107498, asexual blood- 
stage P. falciparum were cultured in the presence of DDD107498 at 
5 X ECso, until parasites became resistant (Extended Data Table 2)”°. 
Resistance was obtained in the 3D7 (drug-sensitive) and 7G8 and Dd2 
(multi-drug-resistant) strains, with minimum inocula of 10’, 10’ and 
10° parasites respectively. Genomic DNA was extracted from resistant 
lines and whole-genome sequencing of ten drug-resistant lines iden- 
tified shared mutations in one gene, which were not present in the 
parental lines: Pf3D7_1451100 (Supplementary Data 1). This gene 
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Figure 4 | DDD107498 targets protein synthesis via eEF2. a, eEF2 promotes 
the GTP-dependent translocation of the ribosome along messenger RNA 
during protein synthesis. b, Homology model of P. falciparum eEF2. The 
mapped mutations from each strain are colour coded by ECsp fold (red, high; 
amber, moderate; green, low). c, Live-cell imaging of P. falciparum expressing 
an extra copy of eEF2 (WT) fused to GFP. The image is representative of more 
than 50 parasites visualized on two independent occasions. d, Protein and 
DNA/RNA synthesis were evaluated by measuring the incorporation of 
[°°S]Met/Cys (upper panel) and (?H] hypoxanthine (lower panel) into 
asynchronous 3D7 wild-type (open circles) and 3D7 DDD107498-resistant 
lines (CEF2-E134A/P754A) (filled circles) after incubation for 40 min with 


encodes P. falciparum translation elongation factor 2 (PfeEF2). Three 
lines had two single nucleotide polymorphisms in PfeEF2, with a 
mixture of wild-type (WT) and mutant reads at each position 
(Supplementary Data 2), suggesting that these lines were mixtures 
of two clones, each with independent mutations in PfeEF2. Single 
nucleotide polymorphisms were confirmed by Sanger sequencing 
and nine validated mutations in PfeEF2 clustered in three regions 
of the encoded protein. In two cases identical single nucleotide poly- 
morphisms were identified in two independent lines, indicating 
mutations in functionally important residues that were acquired inde- 
pendently in separate selection experiments. The fact that resistance 
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DDD107498, cycloheximide or actinomycin D. Radiolabelled incorporation, 
measured as counts per minute, was normalized as the percentage of 
incorporation against inhibitor concentration (means + s.d.; n = 3 indepen- 
dent experiments, each run in duplicate). e, The ECsp values for transfectants 
against DDD107498 (means ~ s.d.; n = 4-7 independent experiments, 

each run in duplicate). Statistical significance was determined by the Mann— 
Whitney U-test: *P < 0.05; **P < 0.01. f, DDD107498-resistant line 
(eEF2-Y186N) transfected episomally with plasmids expressing either WT 
eEF2 or eEF2-Y186N (means = s.d.; n = 3 independent experiments, each 
run in duplicate). 


to DDD107498 can be associated with multiple independent muta- 
tions is in keeping with observations from both artemisinin and other 
antimalarial compounds in development”*”. 

eEF2 is one of several essential elongation factors required in eukar- 
yotic protein synthesis, by mediating GIP-dependent translocation of 
the ribosome along messenger RNA (mRNA) (Fig. 4a)**. Yeast eEF2 is 
the target of the antifungal compound sordarin**”*. Sordarin is select- 
ive for the Saccharomyces cerevisiae eEF2 over mammalian eEF2 in 
vitro, demonstrating that although protein synthesis in eukaryotes is 
conserved, it is possible to obtain selective inhibitors, despite the 
relatively high homology (67.2% identity) between the yeast and 
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human eEF2 (Extended Data Fig. 7)”*”’. In keeping with the potential 
for selectivity, DDD107498 is not toxic to mammalian cells (Extended 
Data Fig. 2c). The PfeEF2 mutations associated with resistance 
mapped to several areas on the surface of a homology model of the 
protein (on the basis of the structure of S. cerevisiae eEF2 with no 
ligands bound”), with the mutations giving the highest degree of 
resistance clustering together (Fig. 4b). A DDD107498 binding pocket 
could not be elucidated through these modelling studies. 

We confirmed that brief pre-incubation with DDD107498 specif- 
ically inhibited P. falciparum protein synthesis, by measuring short- 
term incorporation of [?°S]-labelled methionine and cysteine 
([?°S]Met/Cys) in WT and DDD107498-resistant 3D7 P. falciparum 
parasites (Fig. 4d)’. Cycloheximide (a protein synthesis inhibitor) 
and DDD107498 both prevented [*°S]Met/Cys incorporation into 
WT P. falciparum 3D7, whereas actinomycin D (an inhibitor of 
RNA transcription) had minimal effect. Notably, DDD107498 was 
100-fold less effective at inhibiting protein synthesis in DDD107498- 
resistant 3D7 than WT parasites, whereas cycloheximide prevented 
[*°S]Met/Cys incorporation equally in resistant and WT lines. 
DDD107498 and cycloheximide had minimal effects on DNA/RNA 
biosynthesis both in sensitive and in drug-resistant parasites 
(measured by incorporation of [PH]-labelled hypoxanthine), dem- 
onstrating specificity. In contrast, actinomycin D caused a marked 
dose-dependent reduction in [*H] incorporation. 

To confirm that PfeEF2 is the target for DDD 107498, we integrated 
transgenes expressing either WT PfeEF2 or resistance-associated 
alleles of PfeEF2 (Y186N, observed in mutant Dd2; and P754S, 
observed in mutant 3D7) using attPxattB integrase-mediated recom- 
bination”. These transfectants also express endogenous PfeEF2. 
Imaging of green fluorescent protein (GFP)-fusions of PfeEF2 showed 
cytoplasmic localization (Fig. 4c), indicating that DDD107498 inhi- 
bits protein synthesis in the cytoplasm as opposed to the apicoplast”®, 
the site of action of tetracycline and azithromycin. 

Dose-response assays with DDD107498 showed a similar inhibi- 
tion profile between PfeEF2 transgene-expressing lines and the WT 
Dd2 strain, indicating that the endogenous WT PfeEF2 has a dom- 
inant effect in these experiments (Fig. 4e). This may be due to stable 
complex formation between the ribosome, WT PfeEF2 and 
DDD107498, resulting in ribosome stalling, which would explain 
why in mixed populations the WT PfeEF2 is dominant. For example 
in bacteria, fusidic acid binds to the complex between EF-G and the 
ribosome, preventing dissociation and blocking protein translation*’. 

To determine whether WT PfeEF2 was dominant in poisoning 
translation, we introduced episomal plasmids encoding WT or 
Y186N mutant PfeEF2 into the resistant PfeEF2 Y186N line 
(Fig. 4f). Plasmid-borne PfeEF2-Y186N had no effect on sensitivity 
to DDD107498 (ECs) ~ 3,100 nM), whereas WT PfeEF2 restored 
sensitivity (ECs) = 2 nM). This demonstrated a dominant effect of 
WT PfeEF2 on parasite susceptibility and confirmed that PfeEF2 is 
the primary molecular target of the compound. We note that the 
shallow slope observed for the WT PfeEF2 transfected line is probably 
a result of heterogeneous episomal plasmid copy number seen with 
episomally transformed parasite lines”. Structural studies will be 
required to define precisely how DDD107498 interacts with eEF2 
and the ribosome. 

Resistance has been reported for all clinical antimalarials, including 
artemisinins’. While the correlation between the rate of resistance 
generation in laboratory and clinical settings for antimalarials is not 
fully understood, it is important to evaluate the risk of all new anti- 
malarials both in preclinical and in clinical studies. In our studies, the 
minimum inoculum for generating resistance to DDD107498 is 
within acceptable limits”. Furthermore, selected resistant Dd2 lines 
revealed impaired growth rates in the absence of drug pressure com- 
pared with WT Dd2 (Extended Data Fig. 8); moreover the higher the 
resistance, the greater the fitness defect. Importantly genome sequences 
from 1,685 clinical isolates of P. falciparum from 17 countries” reveal 
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a high degree of PfeEF2 sequence conservation in the field. The sole 
non-synonymous single nucleotide polymorphism (T16S) identified 
was unique to West Africa (allele frequency of 0.002) and is in a 
PfeEF2 domain distinct from mutations associated with in vitro resist- 
ance to DDD107498. 


Conclusion 


DDD107498 represents a promising prospect for development as an 
antimalarial agent, with a potent activity profile against multiple life- 
cycle stages (sub-10 nM), a novel mode of action and excellent drug- 
like properties. It has potential for single-dose treatment, which 
has major implications for ensuring patient compliance and 
practical deployment. Its complementary activity on the sexual 
stages of the parasite has potential to reduce transmission and its 
action on the liver stage suggests a possible role in chemoprotection. 
Chemoprotection and transmission-blocking properties are fun- 
damental to the goal of eliminating and eradicating malaria, for which 
the high potency and long half-life of DDD107498 are well suited. 

Owing to general concerns about the emergence of resistance, all 
antimalarials are developed as combination therapies, a strategy 
shown to improve efficacy and reduce the development of drug res- 
istance. In terms of treatment of the erythrocytic stage, DDD107498 
fulfils the criteria as a long-duration partner to complete the clearance 
of blood-stage parasites’. Therefore, it should be combined with a 
fast-acting compound, ideally with a pharmacological duration of 
action as close to DDD107498 as possible. This would reduce the 
initial level of infection, with the prolonged activity of DDD107498 
eliminating the remaining parasites". 

Inhibition of protein synthesis by DDD107498 through PfeEF2, 
which is expressed in multiple life-cycle stages*, provides mechanistic 
support for the observed broad-spectrum profile. This highlights 
PfeEF2 as a novel drug target in malaria, and implies that inhibition 
of protein synthesis is an effective intervention for achieving multi- 
stage activity in Plasmodium. DDD107498 has now been progressed 
into advanced non-clinical development, with the aim of entering into 
human clinical trials. 
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Extended Data Figure 1 | Synthetic methodology. a, Synthesis of DDD107498 (4). b, Synthesis of DDD102542 (6a) and DDD103679 (6b). 
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Extended Data Figure 2 | In vitro activity. a, In vitro activity against a panel independent [*H] hypoxanthine incorporation experiments (each run in 

of resistant and sensitive strains of P. falciparum. CQ, chloroquine; PYR, duplicate). b, Ex vivo activity against P. falciparum and P. vivax clinical isolates 
pyrimethamine; CYC, cycloguanil; QUI, quinine; SUL, sulfadoxine; MFQ, from Papua (Indonesia). The lines on the scatter plots represent the median 
mefloquine. Resistance as follows: K1 (CQ, SUL, PYR, CYC); W2 (CQ, SUL, value for each drug. c, Effect of DDD107498 on P. falciparum 3D7, HepG2 and 
PYR, CYC); 7G8 (CQ, PYR, CYC); TM90C2A (CQ, PYR, MFQ, CYC); D6 MRCS cells. Data are the means + s.d. of n reported independent experiments. 
(MFQ); V1/S (CQ, SUL, PYR, CYC). Data are the means + s.d. of n = 3 
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Extended Data Figure 3 | Effect of DDD107498 on parasite morphology. 
a, Phenotype of P. falciparum in peripheral blood of NOD-scid IL-2R_null mice 
engrafted with human erythrocytes. Blood samples were taken at days 5 and 7 
of the assay (one and two asexual cycles, respectively) after the start of treatment 
with vehicle or DDD107498 at day 3. The bi-dimensional flow cytometry plots 
measure the murine (Ter-119-PE+) and human (Ter-119-PE-) erythrocytes, 
and the presence of nucleic acids (infected SYTO-16+ events). The blue circles 
indicate the region of infected erythrocytes. Vehicle-treated mice showed a 
characteristic pattern of staining with SYTO-16 (ref. 35), which correlated with 
the presence of healthy rings, trophozoites and schizonts in blood smears. 
Conversely, mice treated with DDD107498 at 50 mg per kg showed only 
trophozoites with condensed cytoplasm and some pyknotic cells at day 5 (red 
circle in flow cytometry plot and corresponding blood smears). By day 7, few 
infected erythrocytes were detected by flow cytometry and blood smears 
revealed parasites with a similar morphology to those at day 5. This suggests 
that trophozoites are the most sensitive population since the cycle is interrupted 


DDD107498 


24h untreated control 


28 
28 


ARTICLE 


DDD107498 


24h treated with DDD107498 


Oh 24h 
T Ss R bi Ss 
2.9 
3.3 
1.6 1.8 
16 0.9 
0.6 0.4 9.1 0.2 
0.6 0.4 0.1 0.1 


at this stage. The images displayed are taken from a mouse with high levels of 
parasitaemia. At least 50 parasites were counted per sample screened in the 
microscope. Of these, four photographs of representative parasite phenotypes 
were selected to represent the morphology of the most prevalent phenotype. 
Thus, this is a qualitative assessment. b, Stage specificity assays using 
synchronized cultures. For morphological analysis of antimalarial drug action, 
thin blood smears were prepared, fixed and stained with Giemsa followed by 
examination with an upright microscope using an oil-immersion lens (100). 
For parasitaemia determination, a total number of 1,000 red blood cells 
(corresponding to five microscopic fields) were counted. R to T, abnormal 
trophozoites observed after exposure for 24 h of synchronized rings to 
DDD107498. T to S, trophozoites do not develop into schizonts after exposure 
for 24 h to DDD107498. S to R, no ring stages are observed 24 h after treatment 
of schizonts with DDD107498. c, Percentage parasitaemia in the red blood cells. 
R, ring stage; T, trophozoite; S, schizont. 
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Extended Data Figure 4 | Prophylactic activity of DDD107498 against sporozoite challenge. P. berghei (luciferase) sporozoite in vivo mouse model of 
chemoprotection. A dose of 3 mg per kg was fully protective. Data are the mean of n = 5 experiments. 
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P. falciparum SMFA 


(standard membrane feeding 
assay) 


Extended Data Figure 5 | DDD107498 in vitro activity on the different vitro against P. berghei ookinetes'® (5.0 nM (95% CI 4.4-5.7 nM)), confirming 
life-cycle stages of Plasmodium spp. For comparator data with known that if DDD107498 was taken up during a blood meal, it could continue to kill 
antimalarials, see figure 5 in ref. 6. DDD107498 also showed potent activity in parasites within the mosquito. 
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Extended Data Figure 6 | In vivo P. berghei mouse-to-mouse assay. 

a, Impact of DDD107498 treatment on in vivo mosquito infection. Mice 
infected with P. berghei (PbGFPCONS07 clone 1)’’ were dosed orally with 
DDD107489 at 3 mg per kg, atovaquone at 0.3 mg per kg, sulphadiazine at 
8.4 mg per kg, or were not drug treated (negative control). After 24 h, popula- 
tions of treated mice (n = 5) were exposed to 500 female Anopheles stephensi 
mosquitoes, and oocyst intensity and infection prevalence in the mosquito 
midgut was measured 10 days after feeding. Individual data points represent the 
number of oocysts found in individual mosquitoes. Two replicates were 
performed. Sulphadiazine (8.4 mg per kg, intraperitoneally) and atovaquone 
(0.3 mg per kg, intraperitoneally) were used as negative and positive 
transmission-blocking drug controls, respectively. Horizontal bars represent 
mean intensity of infection, and error bars s.e.m., within individual mosquito 
populations. b, Impact of treatment with DDD107498 over a complete 
transmission cycle in vivo. After drug treatment of infected mice and mosquito 


feeds, surviving potentially infectious mosquitoes were allowed to blood-feed 
on naive mice at a range of transmission settings (biting rates of 2, 5 and 10 bites 
per naive mouse) to assess the ability of drug treatment to reduce the number 
of new malarial cases after mosquito bite. Efficacy is expressed as the following: 
(1) impact on the mosquito population—expressed as reduction in both 
oocyst and sporozoite intensity and prevalence; (2) impact on subsequent 
transmission to new naive vertebrate hosts—expressed as reduction in infection 
of naive mice (reduction in number of new malarial cases after mosquito bite) 
and inhibition of subsequent parasitaemia (day 10 after bite) in mice that do 
become infected; (3) effect size generated—by fitting the data achieved within 
this assay to a chain-binomial model we could assess the ability of DDD107498 
to reduce Ry (assuming 100% coverage). If Ro is reduced to <1, transmission 
is unsustainable and elimination will occur. Values of 95% CIs are shown in 
brackets. Efficacy is calculated in comparison with no-drug controls. TB, 
transmission blocking. 
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Extended Data Figure 7 | ClustalWS alignment of eEF2 sequences from human, yeast and Plasmodium (P. falciparum). Alignment made using Jalview 
ClustalX default colouring. 
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Extended Data Figure 8 | Fitness phenotypes of DDD107498-resistant 
parasite lines. Unmarked Dd2- and DDD107498-selected parasites with 
various levels of resistance were assessed for growth in a competition assay, 
relative to a Dd2-GFP reference line. a, Equal numbers of unmarked test lines 
were mixed with the Dd2-GFP reference, in triplicate wells, and the ratio of 
non-fluorescent and fluorescent cells assessed by flow cytometry over time. At 
day 0, all lines had a 1:1 ratio with the Dd2-GFP reference. Increased growth of 
the test line over the Dd2-GFP reference, which has a slower growth rate than 
unmarked WT Dd2, would result in an increased ratio of test: Dd2—GFP. 


b, Growth assay of four different test lines: (1) WT Dd2, (2) EF2-E134D, (3) 
EF2-L755F and (4) EF2-Y186N, relative to Dd2-GFP. A faster growth rate of 
WT Dd2 (DDD107498 ICs, 0.14 nM) relative to the fluorescent Dd2-GFP line 
is reflected in an increased ratio over time. The low-level resistant line EF2- 
E134D (ICso 5.8 nM) did not attain a WT growth rate, and the high-level 
resistant lines EF2-L755F (ICs, 660 nM) and EF2-Y186N (3,100 nM) were 
further impaired. Means + s.d.; n = 4 independent experiments, each run in 
triplicate. 
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Extended Data Table 1 | Pharmacokinetics and rodent efficacy of DDD107498 


a 
Mouse Rat Dog 
Intravenous 1 mg/kg 1 mg/kg 1 mg/kg 
Cl (ml/min/g) 12 18 30 
Vdss (L/kg) 15 15 30 
Ty, (h) 16 10 13 
Oral 3 mg/kg 5 mg/kg 3 mg/kg 
Cinex (N'G/ml) 90 180 27 
Tmax () 1 4 4 
Ty, (h) 19 18 20 
F% 74 84 46 
b 
30 mg/kg 10 mg/kg 3 mg/kg 
Compound Activity Survival = Activity = Survival ~— Activity = Survival 
% days % days % days 
Chloroquine’ 99.9 10 99.5 7 83 7 
Mefloquines 99.6 22 95 16 <40 7 
Artemether 98 rf 89 6 59 6 
Dihydroartemsinin 99 7 97 6 61 7 
Artesunate$§ 92 9 75 6 <40 6 
DDD107498 99.1 23 99.0 14 99 9 


a, In vivo pharmacokinetic parameters in preclinical species (n = 3, all species). b, Oral in vivo single-dose efficacy of antimalarial drugs and DDD107498 in the murine P. berghei model. Data are the means of n = 3 
for DDD107498 and n = 5 for the reference compounds. 8Data from ref. 36. 
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Extended Data Table 2 | Summary of resistance selection experiments with DDD107498 


Inoculum * MIR EC., shift 
< Pressure Pressure Number of positive cultures (day of recrudescence) (inoculum)t 
Strain (nM) (x ICeq) 
™ 10* 10° 108 107 108 10° 
8 
Dd2 1.25 27 ND ND 2 . : ND se 
(13, 41) (13,13,15) (13,13, 16) (29/29/29) 4810 (10°) 
3 7 7 
Dd2 1.25 27 0 0 0 (20,22,22) ND ND 10 13 (107) 
Dd2 1.25 27 ND 0 a ND ND ND 108 5567 (10°) 
7G8 1.25 3.1 ND ND ° : : : 107 i ee 
(25, 25, 47) (25) (15,15, 15) 82 (107) 
3D7 1.5 3.3 ND ND ° ; . : 107 : io 
: : (38, 38) (28,28) (20,20,20) 44(107) 


* Number of positive cultures (n = 3) and day of recrudescence for each inoculum. ECs fold change was determined on bulk cultures. MIR, minimum inoculum of resistance. 
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The mass of the Mars-sized exoplanet Kepler-138 b 


from transit timing 


Daniel Jontof-Hutter'?, Jason F. Rowe”, Jack J. Lissauer*, Daniel C. Fabrycky* & Eric B. Ford! 


Extrasolar planets that pass in front of their host star (transit) 
cause a temporary decrease in the apparent brightness of the star, 
providing a direct measure of the planet’s size and orbital period. 
In some systems with multiple transiting planets, the times of the 
transits are measurably affected by the gravitational interactions 
between neighbouring planets'”. In favourable cases, the depar- 
tures from Keplerian orbits (that is, unaffected by gravitational 
effects) implied by the observed transit times permit the planetary 
masses to be measured, which is key to determining their bulk 
densities’. Characterizing rocky planets is particularly difficult, 
because they are generally smaller and less massive than gaseous 
planets. Therefore, few exoplanets near the size of Earth have had 
their masses measured. Here we report the sizes and masses of three 
planets orbiting Kepler-138, a star much fainter and cooler than 
the Sun. We determine that the mass of the Mars-sized inner pla- 
net, Kepler-138 b, is 0.066+ 3-35 Earth masses. Its density is 2.674 
grams per cubic centimetre. The middle and outer planets are both 
slightly larger than Earth. The middle planet’s density (6.273% 
grams per cubic centimetre) is similar to that of Earth, and the 
outer planet is less than half as dense at 2.1*73 grams per cubic 
centimetre, implying that it contains a greater portion of low- 
density components such as water and hydrogen. 

NASA’s Kepler mission has discovered thousands of candidate 
transiting exoplanets, with a wide range of planetary sizes*®. A small 
fraction of these planets have had their masses characterized, either by 
radial-velocity spectroscopy or by transit timing. The latter probes the 
gravitational perturbations between planets in multi-planet systems by 
precisely measuring transit times and fitting dynamical models to the 
observed transit timing variations (TTV)'’. Both radial velocity and 
TTV signals are larger for more massive planets and improve with 
greater planetary masses, although the two techniques sample different 
populations of exoplanets. The radial-velocity technique measures the 
motion of a host star induced by its planet’s gravity, and hence the 
signal declines with increasing orbital distance. 


Table 1 | Stellar and planetary parameters for the Kepler-138 system 


Adopted stellar parameters 


The majority of mass measurements of planets discovered in the 
Kepler data set have been the results of radial velocity observations. For 
Kepler-discovered planets characterized as rocky by this method, the 
orbital periods are all less than a few days. Of the radial-velocity detec- 
tions, Kepler-78b has the lowest mass (1.7M@, where Mg is the 
Earth’s mass), and the shortest orbital period (0.35 days)’*. 

Characterizing planets by transit timing is quite complementary to 
using radial velocity because transit timing is very sensitive to pertur- 
bations between planets that are closely spaced or near orbital reso- 
nances*’. We note that most systems with detected TTVs are not in 
resonance, but rather are near enough to resonance for the perturba- 
tions to be coherent for many orbital periods, while also far enough 
from resonance that the planetary conjunctions cycle around the orbit 
plane within the four-year Kepler baseline. For near-resonant pairs, 
the TTVs of neighbouring planets frequently take the form of anti- 
correlated sinusoids over many orbits'’* or the sum of sinusoids 
where a planet is perturbed by two neighbours’*. The majority of 
TTV detections have been found near first-order mean-motion 
resonances, such that planet pairs have an orbital period ratio near 
to j:j —1, where j is an integer. Near first-order resonances generally 
cause stronger TTV signals than second-order resonances, although 
much depends on how close the planet pairs are to resonance and how 
eccentric (non-circular) their orbits are, as well as the masses of the 
planets. Eccentricity causes the orbital speed to vary during the orbit, 
and the distance between the planets at conjunction to vary with the 
position of the conjunction. 

The bulk of planets with mass characterizations from the Kepler 
sample using TTVs so far have orbital periods ranging from about 10 
days to 100 days. These generally have low density’** and probably 
possess deep atmospheres, with the exception of the rocky planet 
Kepler-36 b”’. 

Kepler-138 (formerly known as KOI-314) hosts three validated 
transiting planets”®. The orbital periods of Kepler-138’s three planets 
are given in Table 1. Kepler-138 c and Kepler-138 d orbit near a 


Stellar mass, My 
(0.521 + 0.055)M5 


Dynamical modelling parameters 


Stellar radius, Ry 
(0.442 + 0.024)R, 


Effective temperature, Terr 


3,841 + 49K 


Stellar density, px 
9.5 +2.2gcm 3 


ecosm 
es +0.096 
0.011 +9098 


Metallicity, [Fe/H] 
—0.280 + 0.099 


esinw 
2 +0.075 
0.0247 5732 


logig (cm s~*)] 
4.886 = 0.055 


(M/Me) X (Mo/My) 
0.131912 


Planet Period (days) To (BJD — 2,454,900) 
b 10.3126 +9 0008 788.4142 + 6.0087 
c 13.7813 0000) 786.1289 + 50008 
d 23.0881 To boee 796.6689 + 5.0013 
Adopted physical characteristics 

Planet R/Rxe Mp (Me) 

b 0.01084 6.0008 0.066+6-089 

c 0.0247 +000. 1.970+1-934 

d 0.0251 +8-0007 0.640+*8-874 


—0,015+0:086 —0.020+0:064 3.85+3.27 


0.126 0.117 2.30 
0.037 +8.080 0,057 +9874 1.284136 
Rp (Re) Density (g cm~3) Incident flux relative to Earth 
0.522 90% 2.624 681/984 
11977387 62i3s 4.63739 
1.212+3-078 21+ 2.327023 
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Figure 1 | Transit timing variations of the three planets orbiting Kepler- 
138. In black are the differences between measured transit times and a 
calculated linear fit to the transit times, with 1o uncertainties shown as error 


second-order mean-motion resonance (5:3), while Kepler-138 b and 
Kepler-138 c orbit near the 4:3 first-order resonance. 

Using transit times up to the fourteenth quarter of the Kepler mis- 
sion, two of the three known planets orbiting Kepler-138”° have been 
confirmed and characterized with TTVs”’. The derived parameters in 
that work suggested that the outer planet, Kepler-138 d, has a density 
so low that it must have a substantial hydrogen/helium gaseous 
envelope”’. 

Using the complete Kepler data set for Kepler-138, we have detected 
TTVs for all three planets (see Fig. 1). We describe our procedure for 
measuring transit times in the Methods, and list our transit time mea- 
surements in Extended Data Table 1. TTVs are expressed as the dif- 
ference between the observed transit times and the calculated linear fit 
to the transit times. Modelling Kepler-138 as a three-planet system, we 
measure the masses of all three planets, the super-Earth-sized Kepler- 
138 c and Kepler-138 d, as well as Kepler-138 b, which at 0.52R@ is 
roughly the size of Mars. 

We performed dynamical fits by calculating the orbits of the three 
planets around the star. We modelled the orbits as being co-planar, 
given that all three planets are transiting, that Kepler’s multi-planet 
systems are known to have small mutual inclinations, and that we 
demonstrate that allowing mutual inclinations has little effect on our 
results for the planet masses (see Extended Data Fig. 7c). 

Our model parameters for each planet are the orbital period P, the 
time To of the first transit after our chosen epoch, the components of 
the eccentricity vector (ecos and esinw, where ~ is the angle between 
the sky plane and the orbital pericentre of the planet), and M,/M,, the 
mass of the planet relative to the host star, which we express as (M,/ 
Meg)(Mo/M,) throughout, where Me is the mass of the Sun. We 
perform Bayesian parameter estimation for these 15 model parameters 
using differential evolution Markov Chain Monte Carlo analysis. We 
report the properties of the star and planets in Table 1, and details of 
the parameter estimation algorithm, priors and statistical models in 
the Methods. 

We measure the mass of Kepler-138 b to be 0.0667*$-)35M @, where 
uncertainties denote the 68.3% confidence limits. The 95.4% interval 
spans 0.011Mq to 0.170M@. The robustness of this result against 
outlying transit times and mutual inclinations is demonstrated in 
the Methods. 

The posterior probability for the inner planet having non-zero mass 
is between 99.82% and 99.91% (depending on the choice of prior for 
eccentricity e), that is, equivalent to a 30 detection. This calculation is 
based on the Savage—Dickey density ratio for calculating the Bayes 
factor, which fully accounts for posterior width and shape, including 
asymmetries and non-Gaussianity, as described in Methods. 

Kepler-138 b is by far the smallest exoplanet, both by radius and 
mass, to have a density measurement (see Fig. 2). Thus it opens up a 
new regime to physical study. It is likely to become the prototype for a 
class of small close-in planets that could be common. The prospect of 
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bars. Grey points mark the difference between the simulated transit times based 
on the best-fit dynamical model and a linear fit to the transit times. a, TTV of 
Kepler-138 b; b, TTV of Kepler-138 c; c, TTV of Kepler-138 d. 


further constraints on this planet are excellent, because NASA’s 
Transiting Exoplanet Survey Satellite should be able to measure transit 
times for the two largest planets, improving constraints on the dynam- 
ical model for all three planets. The European Space Agency’s Plato 
mission will continue this process and ground-based measurements 
may also be possible. These future observations, plus more accurate 
stellar classification using the distance to the star measured by the 
European Space Agency’s Gaia mission, will further improve the char- 
acterization of this system, especially the inner planet. 

Our measurements of the mass and density of the small inner planet 
Kepler-138 b are consistent with various compositions and formation 
locations. If future observations imply that the planet is less dense than 
rock, then the only physically and cosmogonically plausible low- 
density constituents are water and other astrophysical ices, which 
could only have condensed far from the star. That would be the first 
definitive evidence for substantial inward orbital migration of a small 
planet. 

For the two outer planets, Kepler-138 c and Kepler-138 d, we find a 
lower mass ratio between these two planets than does previous work’. 
This is not surprising, since Kepler-138 b’s perturbations explain part 
of the TT Vs observed in Kepler-138 c, which were previously attrib- 
uted solely to perturbations by Kepler-138 d. Nevertheless, the mass 
ratios between each of these planets and their host star remains con- 
sistent with published results”. We find higher densities for both of 
these planets than does previous work, owing to our improved stellar 
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Figure 2 | Mass-radius diagram of well characterized planets smaller than 
2.1 Earth radii, Rg. Prior exoplanet characterizations and 1o uncertainties are 
shown as grey points” *!>"°"??5°°, Black points from left to right are Mercury, 
Mars, Venus and Earth. Red data points are our results for Kepler-138. 

Open circles mark previously measured masses for Kepler-138 c and Kepler-138 
d*'. Error bars mark published 1¢ uncertainties for the planets of Kepler-138 
and masses and radii of all other characterized exoplanets within this size 
range. The curves mark bulk densities of 1 g cm °, 3 gcm * and 10 gcm °. 
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properties, particularly the higher stellar density and consequent smal- 
ler stellar radius. 

Previous estimates of the size and mass of the outer planet Kepler- 
138 d implied that the planet possessed a hydrogen-rich atmosphere”', 
which is difficult to explain with our current understanding of the 
accretion and retention of light gases from low-mass planets orbiting 
close to their star****. Our new measurements could be explained by a 
composition of rock and water. A planet made of rock and water would 
be more stable against mass loss, and would imply that the planet 
formed at a greater distance from the star and migrated. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


We have used all available short-cadence Kepler data and long-cadence data 
wherever short-cadence data are unavailable to complete the data set for 17 quar- 
ters. We list the transit times for each planet in Extended Data Table 1. 
Throughout, we express times since Barycentric Julian Day (BjD) minus 2,454,900. 
Photometric transit and stellar models. From the light curve, we filtered instru- 
mental and astrophysical effects that are independent of planetary transits. To 
each segment of the photometric time series, we fitted a cubic polynomial of width 
2 days, centred on the time of each measurement”. We excluded measurements 
taken within one transit duration (defined as the time from first to last contact) of 
the measured centre of the transit and extrapolate the polynomial to estimate 
corrections during transits. This process strongly filters astrophysical signals with 
timescales of approximately 2 days, which could affect the shape of a planetary 
transit. We also excluded measurements for which the associated segment has gaps 
longer than 2.5 h. 

We fitted the detrended Kepler light curve using a transit model for quadratic 
limb-darkening” and non-circular Keplerian orbits. We stacked transits of each 
planet with corrections for the measured TTVs”’. To account for Kepler’s obser- 
vation cadence, we averaged our transit model with 11 equal spacings within the 
1 min or 30 min integration window. We evaluated the photometric noise for each 
quarter of data to fit transit models, adopting published stellar parameters for 
Kepler-138**. We adopted a two-parameter quadratic model for limb-darkening 
with fixed coefficients (0.3576, 0.3487) appropriate for Kepler’s bandpass and 
Kepler-138’s effective temperature (T.), log[g] and metallicity [Fe/H]*’. 

The light curve model parameters consist of the mean stellar density** py, a 
photometric zero point for each light curve segment, and for each planet the orbital 
period, time of transit, planet-to-star radius ratio, impact parameter, and eccent- 
ricity parameterized as ecosw and esinw. We determined posterior distributions of 
our model parameters using Markov Chain Monte Carlo techniques”’. Our best-fit 
transit models, shown in Extended Data Fig. 1, resulted in consistent estimates for 
P» from each planet. 

We determined the mass and radius of Kepler-138 by fitting the spectroscopic 
parameters (Tr, [Fe/H])* and our light curve contraints of p4 to Dartmouth 
Stellar Evolution models**, assuming a Gaussian probability density for each para- 
meter™. For the Dartmouth models, we varied initial conditions of mass, age, and 
[Fe/H] and interpolated over a grid to evaluate To px, and [Fe/H] for any set of 
initial conditions. We computed posteriors using Markov Chain Monte Carlo 
analysis to obtain stellar model-dependent posteriors on My and Ry. Table 1 lists 
our adopted stellar parameters. We tested the effects of eccentricity priors on our 
measurement of p4, adopting a uniform prior in eccentricity as our nominal 
results. We compare results with eccentricity fixed at zero in Extended Data 
Fig. 2. Although a uniform prior on eccentricity results in a slightly wider range 
of inferred radii for the star, both of these models are consistent with the spectro- 
scopic study of Kepler-138°. 

Our solution for the impact parameter of the middle planet is 0.3 + 0.2, much 
lower than the previous estimate of 0.92 + 0.027. The apparent U-shaped 
transit for Kepler-138 c (Extended Data Fig. 1) is consistent with a low impact 
parameter. Our measured impact parameter for Kepler-138 d is 0.810 + 0.057, 
consistent with the previous measurement”’. Our revised impact parameters imply 
Px = 9.0 + 1.9 gcm *, which agrees with our transit models for each planet, and 
with the spectroscopic study of Kepler-138°. 

We find that Kepler-138 has a smaller mass and radius than previous estimates 
based on the absolute K-band magnitude (Mx) of the star from high-resolution 
Keck spectra*®. These relied on mass-luminosity relations” and a mass-radius 
relation from interferometry**. However, the calibration stars used to correlate the 
Mx to spectral index excluded cool stars that are active. In the case of Kepler-138, 
the photometric time-series exhibits large (1%) variations due to starspots. These 
increase the risk of systematic errors in the measurement of stellar luminosity, and 
therefore the stellar properties derived from the mass-luminosity relation. 

The time-scale for star-spot modulation, ~20 days, was much longer than the 

transit duration, and was probably dominated by two spots. We found no evidence 
of starspot crossings, nor did we find any TTV periodicities related to the rotation 
period of the star. Hence, the stellar activity is unlikely to affect our transit model 
or transit times. 
Analytical constraints from TTV. Orbital period ratios determine how close 
planets are to mean motion resonance, and over what period TTVs are expected 
to cycle. The inner pair of planets of Kepler-138 orbit near the 4:3 resonance with an 
expected TTV cycle of 1,570 days, slightly longer than the 1,454-day observational 
baseline of Kepler-138 b’s transits. We fitted a sinusoid at this periodicity to the 
TTVs of the inner planet, and detected a TTV amplitude of 34 + 4 min. This permits 
a rough estimate for the mass’? of the middle planet of (6.8 + 0.9)Mo(Mo/Mx); 
which is close to our final measure of the mass of the middle planet. 


The detection of TTVs at Kepler-138 b imply that the TTVs in Kepler-138 c 

have a component caused by Kepler-138 b. However, the middle planet’s TTVs are 
the combined effect of perturbations from its two neighbours Kepler-138 b and 
Kepler-138 d, and the outer pair orbit near a second-order mean motion resonance 
for which there is no known analytical model. Hence, the masses of the inner and 
outermost planets cannot be estimated by fitting such a simplified sinusoidal 
model to the TTVs. 
Detailed TTV modelling. For each set of initial conditions, we calculate the 
transit times of all three planets based on Newtonian gravity, using an eighth- 
order Dormand-Prince Runge-Kutta integrator'>!**?. We compare the simulated 
and observed transit times for each planet assuming each observed transit time has 
an independent Gaussian measurement uncertainty. 

We found an excess of outlying transit times to our model, where either instru- 
mental effects or stellar activity led to a few unlikely transit times with under- 
estimated uncertainties. Extended Data Fig. 3 shows the distribution of residuals to 
our best-fit TTV model compared to a Gaussian distribution, revealing these 
outliers. Of the 257 measured transit times, five are outliers, where for simulated 
transit times S, and measurement uncertainties oy, (O — S)/orr > 3. We removed 
these outliers, and used the 252 remaining measurements as our nominal data set 
for dynamical models. Later, we tested our results for robustness against outliers. 

Performing extensive grid searches and Levenberg-Marquardt analysis, we 
found a single region of high posterior probability including multiple closely 
related local minima. To characterize the masses and orbital parameters of the 
three planets, we performed Bayesian parameter estimation using a differential 
evolution Markov Chain Monte Carlo algorithm**'. We used Metropolis- 
Hasting acceptance rules on 45 ‘walkers’ exploring parameter space in parallel, 
where for each walker, a proposal is a scaled vector between two other walkers 
chosen at random. Using differential evolution for the proposal steps increases the 
probability that proposals will be accepted, particularly for target distributions 
with strong correlations between model parameters. 

The walkers were launched near the best-fit model found by Levenberg- 
Marquardt analysis. We updated the vector scale length factor every 15 genera- 
tions to keep the acceptance rate near the optimum value of 0.25*° and thinned 
the data by recording every 20th generation in the Markov Chain. We discarded 
the first 50,000 generations and continued the Markov Chain for 1,250,000 addi- 
tional generations. 

The mean correlation between parameter values in the same Markov Chain 
Monte Carlo chain for a given separation in the chain begins near unity for 
consecutive generations, and declines for greater separations’. We used this 
property to assess how well-mixed our Markov Chain Monte Carlo chains were. 
The autocorrelation length, defined as the lag between generations required for the 
correlation to fall below 0.5, varied between the walkers, averaging 9,000 genera- 
tions for the planet-to-star mass ratio of Kepler-138 b. Hence, the mean length of 
the differential evolution Markov Chain Monte Carlo chains was about 139 auto- 
correlation lengths. 

For each planet, we adopted a uniform prior in orbital period, To, e, and @. For 
M,/Mx, we adopted a uniform prior that allowed negative masses to enable a 
simple estimate for the significance of a positive mass for Kepler-138 b from the 
posterior samples*’. For Kepler-138 b, the mass is greater than zero in 99.84% of 
posterior samples. As a further test, we considered an alternative, more realistic, 
uniform prior where planetary masses are positive definite and limited to the mass 
of a pure iron planet (given their sizes)***. 

Posteriors for the mass ratios of each planet to the host star, and each eccent- 

ricity vector component are shown in Extended Data Fig. 4. We list 68.3%, 95.4%, 
and 99.7% credible intervals for all parameters in Extended Data Table 2. We also 
include the mass ratios between the planets and relative eccentricity vector com- 
ponents, which are more constrained by the data than absolute masses and eccent- 
ricities. 
Orbital eccentricity. Joint posteriors for the eccentricity vector components are 
displayed in Extended Data Fig. 5. These show extreme correlations because a 
broad class of orbital parameters in which there is a precise apsidal alignment of 
orbits satisfies the data. 

We performed long-term integrations on a subset of our posterior planet masses 
and orbital parameters including a wide range of eccentricities using the HNBODY 
code’. We investigated whether long-term stability could further constrain the 
eccentricities. A sample of the solutions were integrated for ten million orbits and 
all were found to be stable. Integrating one of the best-fit solutions with high 
eccentricities for Kepler-138 b, Kepler-138 c and Kepler-138 d of 0.23, 0.20 and 
0.18 respectively, we confirmed that the orbits were stable for over one billion 
years; the apsidal lock was maintained with the periapses of all three planets closely 
aligned. Although rare in the Solar System, apsidal alignment has been observed 
and studied in the Uranian ring system*”“* and it has been detected in exoplanetary 
systems, including v Andromedae”, GJ 876°° and possibly 55 Cancri”’. 
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Significance of mass detection for Kepler-138 b. We establish the statistical 
significance of a non-zero mass for Kepler-138 b by computing the Bayes factor, 
that is, the ratio of the marginalized posterior probability for a model with the mass 
of planet Kepler-138 b fixed at zero relative to the marginalized posterior prob- 
ability for a model where all three planets have a non-zero mass. Intuitively, the 
Bayes factor quantifies how much the transit timing data has increased our con- 
fidence that Kepler-138 b has a non-zero mass. When performing Bayesian model 
selection, it is essential to choose proper (that is, normalized) priors for any 
parameters not occurring in both models. For the mass of Kepler-138 b we adopt 
a uniform prior ranging between zero mass and the mass of an iron sphere the size 
of Kepler-138 b. We tested two models of iron planets as upper limits on our mass 
priors***. We compute the Bayes factor using the generalized Savage-Dickey 
density ratio based on the posterior samples from our nominal model”. We find 
that the three-massive-planets model is strongly favoured, with the posterior 
probability for the three-massive-planets model equal to 99.82%** (99.80%**) for 
our nominal model (that is, three massive planets, each with a uniform eccentricity 
prior) or 99.91% (99.90%**) for a model with three massive planets, each with a 
Rayleigh distribution (o = 0.02°°) for an eccentricity prior. A more restrictive prior 
for the mass of Kepler-138 b would further increase the posterior probability for 
the three-massive-planets model. 

The generalized Savage-Dickey density ratio is superior to more commonly 
used substitutes (such as the Akaike information criterion, AIC, or the Bayesian 
Information Criterion, BIC), since the Savage-Dickey density ratio provides a 
practical means of calculating the Bayes factor, the actual quantity of interest for 
rigorous Bayesian model comparison. The AIC and BIC use only the likelihood of 
the two best-fit models and do not account for the width or shape of the posterior 
probability distributions. Extended Data Fig. 4a shows that the marginal posterior 
for the mass of Kepler-138 b is asymmetric and non-Gaussian, so the AIC or BIC 
would be a particularly poor choice for our problem. 

Therefore, we have computed the rigorously correct Bayes factor using the 
Savage-Dickey density ratio, which provides an efficient way of calculating the 
Bayes factor when comparing two nested models, meaning that the simpler model 
is equivalent to the more general model when the additional parameters (0) take 
on a particular value (0 = 09). In our case, this occurs when My = 0. One 
advantage of the Savage—Dickey density ratio over computing fully marginalized 
likelihoods is that the Savage-Dickey density ratio can be computed from the 
posterior for the more general model. This is computationally practical when 
comparing models that differ by one to a few dimensions, since then the posterior 
at 0 can be computed from a posterior sample using a kernel density estimator. 
We estimate the posterior density using a Gaussian kernel density estimator with 
bandwidth 0.001M@(M,/Mo), which was found to be optimal when analysing 
synthetic posterior samples data. We verified that the results were insensitive to 
varying the choice of bandwidth by an order of magnitude. 

Sensitivity analyses. We performed several tests to assess the robustness of our 
results to: (1) the choice of prior for eccentricity, (2) the treatment of transit time 
outliers, (3) the assumption of co-planarity, and (4) our algorithm. For these 
sensitivity analyses, we adopt our nominal prior for planet masses to allow for 
negative planet masses. While any such models are clearly unphysical, allowing for 
such models offers an efficient and intuitive means for evaluating whether the 
lower limit on the planet masses is robust to the above assumptions. 

Choice of eccentricity prior. As noted above, our nominal model has a uniform 
prior in eccentricity. The joint posterior for the mass ratio of Kepler-138 b to the 
host star and its orbital eccentricity is shown in Extended Data Fig. 5. We show this 
plot with two alternative priors in eccentricity: a Rayleigh distribution with a scale 
length 0.1°*, and a more constrained one, consistent with Kepler’s multi-planet 
systems, with a scale length of 0.02°°. Since the data constrain the eccentricity so 
weakly, the choice of priors strongly affects the posteriors of eccentricity. However, 
the planet-to-star mass ratios were much more weakly affected by the choice of 
eccentricity prior, as shown in Extended Data Fig. 7a. 

Because the apsidally locked solutions are long-term stable, we adopt the uniform 
prior as our nominal solution, and note that a more constraining prior on eccent- 
ricity results in a marginally wider posterior for mass ratio of Kepler-138 b to the 
star. Although, as noted above, the orbital eccentricities in the system are only 
weakly constrained, the relative eccentricities and mass ratios between the planets 
are tightly constrained by the TTVs, as shown in Extended Data Fig. 6 and 
Extended Data Table 2. 

Transit time observation outliers. To assess the effect of residual outlying transit 
times, we repeated the analysis with two other sets of measured transit times, 
differing only in that we removed transit time residual outliers beyond (1) 40 
(leaving 254 transit times) or (2) 2.50 (leaving 244 transit times), as opposed to 
our nominal data set, which excluded 3¢ outliers. Extended Data Fig. 3 shows the 
relative frequency of residuals compared to a Gaussian. The majority of outliers are 
between 2.50 and 30. We excluded outliers beyond 4¢ from all models. 
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The posterior for the mass of Kepler-138 b for each of these three data sets is 
displayed for comparison in Extended Data Fig. 7b. Overall, the outliers have a 
modest effect on the measured mass ratio for Kepler-138 b to the host star. Since 
most of the outliers are in the transit times of Kepler-138 b, these have a small effect 
on our mass measurement of Kepler-138 b. Furthermore, the mass of Kepler-138 c 
is constrained by its effect on the transit times of both planets Kepler-138 b and 
Kepler-138 d. Nevertheless, we note that the inclusion of more outliers increases 
the skewness of the posterior for planetary mass and causes the mode of the 
distribution to shift to a slightly lower mass. 

Assumption of co-planarity. The known low inclination dispersion amongst 
Kepler’s multiplanet systems makes co-planarity a reasonable assumption for 
TTV modelling’*****. Furthermore, geometric considerations make multi-planet 
transiting less likely to be observed for systems with large mutual inclinations. 
Nevertheless, we tested the effect of mutual inclinations on our solutions. We 
performed two additional sets of simulations: (1) with the longitude of the ascend- 
ing node of Kepler-138 b as a free parameter, leaving the other two planets as co- 
planar, and (2) with a free ascending node for Kepler-138 d. In each case, we 
adopted a uniform prior for the ascending node. In Extended Data Fig. 7c, we 
compare the posteriors for the mass ratio of Kepler-138 b to the host star with 
mutual inclinations to our nominal result. The nominal result gives a consistent, 
but slightly wider posterior than with free ascending nodes. 

Tests with synthetic transit times. We evaluated our method by generating 
synthetic data sets of transit times with known planetary masses and orbital 
parameters to test how well the input parameters were recovered. The results 
are shown in Extended Data Fig. 8. We generated synthetic transit times based 
on the median values of each parameter from the marginal posteriors of our 
nominal model and added Gaussian noise to each observation with a standard 
deviation equal to the measured timing uncertainties. Our analysis of the simu- 
lated data results in a posterior for the mass of Kepler-138 b consistent with the 
true values. While the resulting posterior for the mass of Kepler-138 b shows a 
slightly higher mode and is less skewed than the posterior for the nominal model, 
the differences are comparable to the minor effects of transit timing outliers or 
choice of eccentricity prior. This result validates our both our transit timing 
method and our TTV analysis. 

Additionally, we generated eight independent synthetic data sets with zero mass 

for Kepler-138 b and other parameters based on our nominal model (Extended 
Data Fig. 8). In all eight cases, the posterior probability for Kepler-138 b’s mass was 
insignificant, with only 16% to 86% of posterior samples having a mass greater 
than zero, consistent with expectations for non-detections. This is in sharp con- 
trast to our analysis of the actual data, which results in 99.84% of the posterior 
yielding a positive mass for Kepler-138 b. 
Planetary characteristics. Our adopted credible intervals in planetary mass and 
density were calculated by repeatedly multiplying samples from the posteriors of 
planet-to-star mass ratios, and My. Uncertainties in planetary radii were calcu- 
lated with the fractional uncertainty in the stellar radius and the uncertainty in 
planet-to-star radius ratio added in quadrature. Time-averaged incident flux for 
each planet compared to the Earth was calculated in the low eccentricity limit, 
although we note that if the orbits were highly eccentric, the fluxes would be 
marginally higher. 
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Extended Data Figure 1 | Folded light curves with corrections for observed 
TTV for Kepler-138. The scattered points are photometric relative fluxes and 


the curves are analytical models of the transit shape described in the text. 
a, Kepler-138 b; b, Kepler-138 c; c, Kepler-138 d. 
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Extended Data Figure 2 | Stellar mass and radius models using constraints 
on the stellar mean density inferred from the light curve. In cyan are models 
that adopted a uniform prior in eccentricity, and in magenta are constraints 
found with orbital eccentricities fixed at zero. Grey points with error bars mark 
stellar parameters found in the literature while the black error bars mark our 
adopted solution for stellar mass and radius. 
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Extended Data Figure 3 | The distribution of residual normalized 
deviations from our best fit dynamical model to the raw transit times. The 
histogram marks deviations calculated as: (O — S)/o-ry, where O is the observed 
transit time, S is the simulated transit time and ory is the measurement 
uncertainty. The green curve marks a Gaussian distribution. 
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Extended Data Figure 5 | Joint posteriors of model parameters and the 
effects of eccentricity priors. The dark (light) grey marks the 68.3% (95.4%) 
credible intervals for each joint posterior. a-c, M,/M, and eccentricity vector 
components for the inner and middle planets. d-f, M,/M,, and eccentricity 


(Mp/Ma)(M./Mx) 


(Mp/Mz)(M/My) 


vector components for the middle and outer planets. Panels g, h and i compare 
Kepler-138 b’s M,/M, and its orbital eccentricity, for three eccentricity priors: 
a uniform prior on eccentricity (g), and models with a Rayleigh distribution of 
scale factor 0.1 (h), and 0.02 (i). 
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relative eccentricities between planets. The mass ratio of the inner and middle _ eccentricity vector components—that is, the difference in ecosw (esinw) in the 
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Extended Data Figure 7 | Sensitivity tests for the effects of eccentricity 

prior, outlying transit times and free inclinations on the mass of Kepler-138 
b relative to the host star. Panel a compares a uniform prior (black curve, our 
nominal posterior for all comparisons) and a Rayleigh distribution with scale 
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factors 0.1 (navy) and 0.02 (cyan). Panel b compares posteriors with 30 outliers 
excluded (black), with two alternatives: 4c outliers (blue) and 2.5 outliers 
removed (light green). Panel c compares our nominal model with one with a 
free ascending node for the inner (purple) or outer (red) planet. 
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Extended Data Figure 8 | Validation of our method with synthetic data sets. _ our claim for a positive mass detection for Kepler-138 b. The magenta and 
The green curve marks the posterior for a synthetic data set generated with the _ purple shades are posteriors for models using data generated with zero mass for 
same parameters as were the medians of our nominal posteriors (in Table 1). Kepler-138 b. These zero-mass synthetic models all reproduced non- 

The agreement between the green and black curves validates our method and __ detections. 
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Extended Data Table 1 | Transit times of Kepler-138’s planets 


Kepler-138 b 
56.23045-+40.01261 
66.55886-40.01962 
76.8641440.01211 
87.16890-0.01670 
97.51565-£0.01126 
107.76610+0.01772 
118.17035-+0.02111 
128.43371 0.02127 
138.68975-£0.01501 
149.08968£0.02329 
159,35323+0.01381 
169.6747140.02554 
179.95288+0.01453 
190.29561-+0.03001 
200.53153-0.05541 
210.92867+0.02677 
221.21582+0.01553 
231.53491+0.02127 
241.8643140.02522 
252.1638240.01926 
262.47265-+0.02855 
272.73469-+40.02464 
293.42230-+0.01085 
314.03280-+40.01904 
324.3552440.01297 
334.66018+0.01269 

345.0224940.01162* 
355.289500.01491 
365.58461+0.01360 
386,22924+0.01392 
396.5372140.01443 
406.84333L0.01218 
417.16572£0.01807 
427.47777£0.01928 
437.7762340.01316 
448.06880-L0.01816 
458.42534£0.01954 
468.71871£0.00985 
479.06107£0.01195 
489.32477£0.01240 
509.97820-+40.01444 
520.29576+0.01286 
530.64900+0.01046* 
540.89551+0.01345 
551.21595£0.01603 
561.5313140.01220 
571.8404640.01392 
582.17781+0.01054 
592.47561+0.01480 
602.78923+0.01425 
613.08427+0.01595 


633.73803+0.01076 
674.96352£0.01399 
705.895330.02944 
716.2233840.01418 

726.53871+0.01875 

747.16800+0.01297 

757 44480+0.00921* 
767.76381+0.01237 

788,40859-+0.01624 
798.73107 0.01595 

809.02624+0.01691 

819,36067 0.01339 
829.65344+0.00945 

840.01903-40.02614 
850.30643-L0.02219 
860.61829+0.02166 
870.907350.01757 

881.213710.02906 
891.58748+0.01517 

901.831 11+0.02494 
912.1686] +0.04122 

922.61657+0.01640* 
932.78055+0.02060 
943.12746+0.02492 

953.41465+0.01273 

963.762980.01606 

974.06380-£0.01566 
984,36675+0.01611 

994.701830.01800 
1015.35516+0.01660 
1025.61860+0.01477 
1056.56881 0.01466 
1066.88059+0.01294 
1077.19053-+£0.01288 
1108.14741+0.01024 
1128.80557 0.01898 
1139.08576+0.01882 
1149,39509+0.02059 
1170.02484+0.01307 
1180,34646+0.01704 
1190.66161+0.01759 
1200.98622+0.01705 
1221.60140+0.01428 
1231.90599+0.01844 
1242.23411+£0.01937 
1252,57897 0.02116 
1262.84891 +0.02335 
1273.19155+0.01717 
1283.49165+0.01618 
1293.77245+0.01799 
1304.09485+0.01510 
1314,43639+0.01590 


1324.71431+0.01427 
1335.04216+0.01297 
1345.36064+0.02184 
1355.68308+0.01380 
1365.99 1070.01877 
1376.29867+0.01476 
1386.618210.01230 
1396.93177+0.01012 
1407.23423+0.01422 
1427.86777£0.01784 
1438.17099+0.01347 
1448.500212-0.01257 
1469. 12674+0.01137 
1479.43924+0.01741 
1489.75786+0.01432 
1500.06004--0.01604 
1510.397430.01074 
1520.70942+0.01625* 
Kepler-138 c 
55.73273£0.00208 
69.50883 0.00225 
83.29050+0.00217 
110.85343+0.00484 
124.63503+0.00333 
138.42452+0.00358 
152.19210+0.00315 
165.97408.0.00308 
193.54290+.0.00687 
221.09921+0.00286 
234.87885+0.00353 
248.65986+0.00433 
262.44586+0.00462 
276.22920+0.00397 
290.00336+0.00294 
317.56657£0.00224 
358.90489+0.00228 
372.69242+0.00221 
386.4739940.00224 
414.039360.00188 
427.81517.0.00240 
469.16591+0.00245 
482.93674+0.00261 
496.72271 0.00322 
510.49745+0.00182 
524.28649+-0.00244 
538.06429+0.00185 
551.84901+0.00219 
579.40750+0.00221 
593.18526+0.00196 
606.97 103+0.00241 
620.754360.00195 
634.533080.00183 


648.31255-0.00155 
675.875190.00283 
689.655120.00208 
717.2257540.00237 
731.00161+0.00295 
772.34908£0.00190 
786.13380+0.00298 
799.91135+0.00223 
813.689330.00227 
827.47331+0.00161 
841.25609-0.00213 
855.03878-£0.00336 
882.59698-++0.00361 
896.37408-£0.00312 
910.15919-0.00428 
923.9451 1 0.00405 
951.50290-0.00319 
979.06837 +0.00273 
992.84600-£0.00482 
1006.62904-+0,00238 
1020.40638-+0.00267 
1034.18777 0.00273 
1047.97221+0.00238 
1061.75267£0.00194 
1075.53011+0.00193 
1089.31318+0.00199 
1103.09090-£0.00304 
1116.87516+0.00212 
1130.65224+0.00246 
1144.43817£0.00307 
1158.20412+0.00472 
1171.99592+0.00255 
1185,78181 0.00356 
1199,56134+0.00251 
1213.34319-+0.00211 
1240.89572+0.00416 
1254.681130.00281 
1268.46175+£0.00161 
1282.24807+0.00185 
1296.02377+0.00257 
1309,80228-+0.00198 
1323.58892+0.00166 
1364,92964-+0,00252 
1378.71091+0.00204 
1392.49762£0.00311 
1406.27306-+0.00251 
1433.83402+0.00231 
1447.61464+0.00223 
1461.39945+0.00302 
1475.17987+0.00450 
1488.95524+0.00212 
1502.73866+0.00223 


Kepler-138 d 
57.81311+0.00267 
80.90677+0.00212 
127.0836040.00410 
150.17399+0.00382 
173.27102+0.00324 
196.36319+0.00534 
219.45290+0.00346 
242.54570+0.00639 
265.63275+0.00311 
288.72239+0.00210 
311.81585+0.00301 
334,90440+0.00272 
357.9948740.00252 
381.08423-0.00274 
404.17500+0.00242 
427.26468-0.00206 

450.35127+0.00336 
473.440820.00289 
496.53494+0.00178 
519.62090+0.00176 
542.70985+0.00208 
588.88500+0.00221 
611.97165+0.00281 
635.05492+0.00235 
681.23384-£0.00232 
704.32004-0.00276 
727.40385£0.00198 
750.49681+0.00318 
773.58233+0.00193 
796.66645+0.00288 
819.75707+0.00448 
842.84842+0.00305 
889.01363-+0.00268 
935.19781+0.00515 
958.28389-+0.00303 
981.38587-0.00477 
1073.73237 £0.00217 
1142.99351+0.00424 
1166.09435-+0,00502 
1189.18306-+0.00352 
1212.2745340.00191 
1235.36836+0.00368 
1258.45747+£0.00330 
1281.54282+0.00419 
1327.72877£0.00212 
1373.91321-+0.00170 
1396.99052+0,00283 
1443.17833+0.00318 
1466.27118+0.00365 
1489.35757+0.00347 
1512.45493-£0.00353* 


All times are expressed in days since BJD 2,454,900. Estimated uncertainties give 68% confidence limits. Outliers beyond 3c in the residuals of dynamical 
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Extended Data Table 2 | Confidence intervals from distributions found with differential evolution Markov Chain Monte Carlo TTV analysis. 


Param. \ Conf. Int. 68.3% 95.4% 99.7 % 


0.0004 0.0006 —-+0.0007 
P (days) b | 10.3126“) 0006 “0010 0.0014 


+0.0001 +0.0002 +0.0002 
c 13.7813 —0.0001 —0.0002 —0.0004 


+0.0009 +0.0016 +0.0022 
d 23.0881 —0.0008 —0.0012 —0.0013 


+0.0027 +0.0055 +0.0084 
To b | 788.4142 0.0027 —0.0054 —0.0081 


+0.0005 — +0.0010 +0.0015 
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+0.0013 +0.0025 +0.0035 
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cf} 0.02055 hate T8369 
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mere b) (0.129 hi aoe OS 
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d) 1.282385 he TAT 


+0.0136 +0.0284 +0.0447 
M,/M. 0.0353 0.0125 —0.0244 —0.0359 


+0.0167 +0.0389 +0.0530 
CcCOSWe- pCOSW, | 0.0038 750135 — “oo487 “ty O702 


‘ ‘ +0.0202 -+0.0469 »—-+0.0625 
e-Sinw,- €,Sinw, | -0.0033 2013 “y.os70 =0,0608 


° +2.20 +10.66 +17.86 
We— Wh (°) -0.58 2.41 -10.51 -16.78 


+0.4434 +1.0303 +1.7453 
M /Ma 2.9586 —0.3528 —0.6369 —0.8774 


0.0312 +0.0391 +0.1059 +0.1845 


CCOSWe- Cg COSWq 0.0205  -0.0458 0.0664 


By co 40.0322 +0.0874.~—=40.1439 
e-SiNw,- egSinwy | 0.0186 “y5319 = “h0750 = “0°08 
; © 49.11 413.75 +14.62 

We- Wg (°) 3.31 -15.30 20.34 21.24 


We include the parameters of our dynamical fits, as well as the mass ratios and relative eccentricity vector components between the planets, which have tighter constraints than the absolute masses or eccentricity 
vector components. 
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A permanent, asymmetric dust cloud around 


the Moon 


M. Horanyi''*, J. R. Szalay*, S. Kempf", J. Schmidt*, E. Griin’*, R. Srama® & Z. Sternovsky!*” 


Interplanetary dust particles hit the surfaces of airless bodies in the 
Solar System, generating charged’ and neutral’ gas clouds, as well 
as secondary ejecta dust particles*. Gravitationally bound ejecta 
clouds that form dust exospheres were recognized by in situ dust 
instruments around the icy moons of Jupiter* and Saturn’, but 
have hitherto not been observed near bodies with refractory rego- 
lith surfaces. High-altitude Apollo 15 and 17 observations of a 
‘horizon glow indicated a putative population of high-density 
small dust particles near the lunar terminators®’, although later 
orbital observations*” yielded upper limits on the abundance of 
such particles that were a factor of about 10* lower than that neces- 
sary to produce the Apollo results. Here we report observations of a 
permanent, asymmetric dust cloud around the Moon, caused by 
impacts of high-speed cometary dust particles on eccentric orbits, 
as opposed to particles of asteroidal origin following near-circular 
paths striking the Moon at lower speeds. The density of the lunar 
ejecta cloud increases during the annual meteor showers, especially 
the Geminids, because the lunar surface is exposed to the same 
stream of interplanetary dust particles. We expect all airless plan- 
etary objects to be immersed in similar tenuous clouds of dust. 

The Lunar Atmosphere and Dust Environment Explorer (LADEE) 
mission was launched on 7 September 2013. After reaching the Moon 
in about 30 days, it continued with an instrument checkout period of 
about 40 days at an altitude of 220-260 km. LADEE began its approxi- 
mately 150 days of science observations at a typical altitude of 20-100 
km, following a near-equatorial retrograde orbit, with a characteristic 
orbital speed of 1.6 km s_' (ref. 10). The Lunar Dust Experiment 
(LDEX) began its measurements on 16 October 2013 and detected a 
total of approximately 140,000 dust hits during about 80 days of 
cumulative observation time out of 184 total days by the end of the 
mission on 18 April 2014. LDEX was designed to explore the ejecta 
cloud generated by sporadic interplanetary dust impacts, including 
possible intermittent density enhancements during meteoroid 
showers, and to search for the putative regions with high densities of 
0.1-um-scale dust particles above the terminators. The previous 
attempt to observe the lunar ejecta cloud by the Munich Dust 
Counter on board the HITEN satellite orbiting the Moon (15 
February 1992 to 10 April 1993) did not succeed, owing to its distant 
orbit and low sensitivity’. 

LDEX is an impact ionization dust detector (Methods subsection 
“The LDEX instrument’). When pointed in the direction of motion of 
the spacecraft, LDEX recorded average impact rates of about 1 and 
about 0.1 hits per minute of particles with impact charges of q = 0.3 
and q = 4 fC, corresponding to particles with radii of a Z 0.3 um and 
a = 0.7 um, respectively (Fig. 1). Approximately once a week, LDEX 
observed bursts of 10 to 50 particles in a single minute. Particles 
detected in a burst are most likely to originate from the same well- 
timed and well-positioned impact event that happened just minutes 
before their detection on the ground-track of LADEE. Several of the 


yearly meteoroid showers generated sustained elevated levels of LDEX 
impact rates, especially those where the majority of the incoming 
meteoroids hit the lunar surface near the equatorial plane, greatly 
enhancing the probability of LADEE crossing their ejecta plumes. 
The Geminids generated the strongest enhancement in impact rates 
for +1.5 days centred around 14 December 2013. 

The distribution of the detected impact charges remained largely 
independent of altitude, and throughout the entire mission it closely 
followed a power law: p,(q) « q ° * (Fig. 2). This alone indicates 
that the initial mass distribution of the ejecta particles is, to a good 
approximation, independent of their initial speed and angular distri- 
butions (Methods subsection “Dust ejecta clouds’), and that the num- 
ber of ejecta particles generated on the surface per unit time with 
mass greater than m follows a power law: N‘(>m) «x m ”. The 
LDEX measurements indicate « ~ 0.9, surprisingly close to the value 
og = 0.8 suggested by the Galileo mission at the icy moons of Jupiter’? 


Daily average impact rate (per minute) 
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Figure 1 | Impact rates throughout the mission. The daily running average of 
impacts per minute of particles that generated an impact charge of q = 0.3 fC 
(radius a = 0.3 um) and q = 4 fC (radius a 2 0.7 um) recorded by LDEX. The 
initial systematic increase until 20 November 2013 is due to transitions from the 
high-altitude checkout to the subsequent science orbits. Four of the several 
annual meteoroid showers generated elevated impact rates lasting several days. 
The labelled annual meteor showers are: the Northern Taurids (NTa); the 
Geminids (Gem); the Quadrantids (Qua); and the Omicron Centaurids (oCe). 
The observed enhancement peaking on 25 March 2014 (grey vertical line) does 
not coincide with any of the prominent showers. During the Leonids meteor 
shower around 17 November 2013, the instrument remained off due to 
operational constraints. From counting statistics, we determine that the daily 
average impact rate of particles generating a charge of at least 0.3 fC is 1.25 hits 
per minute and, hence, the 1o relative error is about 2%, while for particles 
generating an impact charge > 4 fC the average rate is 0.08 hits per minute and, 
hence, the 1o relative error is about 10%. 
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Figure 2 | Slope of the charge and mass distributions. The exponent of 
the power-law distributions of the impact charges p,(q) « q “ * ® fitted to 
LDEX measurements as functions of altitude (15 km bins) and time (10 day 
bins). The colour indicates the value of the charge distribution index —(1 + «), 
and the size of a circle is inversely proportional to its absolute uncertainty 
(largest circle: +0.1; smallest circle +0.5). The inset shows the impact charge 
distribution for all heights for the entire mission, resulting in a y* minimizing 
fit” of 2 = 0.910 + 0.003. 


and to laboratory experimental results of ejecta production from 
impacts’*. The derived ejecta size distribution also represents the size 
distribution of the smallest lunar fines (very small particles) on the 
surface because most ejecta particles return to the Moon and comprise 
the regolith itself, unless these small particles efficiently conglomerate 
on the lunar surface into larger particles. 

The characteristic velocities of dust particles in the cloud are of the 
order of hundreds of metres per second, which is small compared to 
typical spacecraft speeds of 1.6 kms. Hence, with the knowledge of 
the spacecraft orbit and attitude, impact rates can be converted directly 
into particle densities as functions of time and position. This approach 
is expected to result in a relative error <20%, on the basis of a com- 
plete ejecta cloud model’? (Extended Data Fig. 2). Both the derived 
average number density as a function of height, and the initial speed 
distribution match expectations only for altitudes h = 50km 
(Extended Data Fig. 3). This indicates that, for the lunar surface, the 
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Figure 3 | Lunar dust density distribution. a, The top-down view of the dust 
density n(a = 0.3 um) projected onto the lunar equatorial plane. While pointed 
near the direction of the motion of the spacecraft, LDEX did not make 

measurements between 12 and 18 Lr. White colouring indicates regions where 
LADEE did not visit or was not set up for normal operations. b, The density as a 
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customary assumption ofa simple power-law speed distribution with a 
single sharp cut-off minimum speed uy needs revision for speeds below 
about 400 ms '. At higher values the speed distribution follows a 
simple power law (Extended Data Table 1), as predicted by existing 
models’*. An ejecta plume opening cone angle of Wo ~ 30° is consist- 
ent with our measurements, including those taken during the observed 
bursts of impacts. The average total mass of the dust ejecta cloud is 
estimated to be about 120 kg, approximately 0.5% of the neural gas 
atmosphere™. 

We found that the density distribution is not spherically symmetric 
around the Moon (Fig. 3), exhibiting a strong enhancement near the 
morning terminator between 5 and 7 h local time (11), slightly canted 
towards the Sun from the direction of the motion of the Earth-Moon 
system about the Sun (6 LT). The observed anisotropy reflects the 
spatial and velocity distributions of the bombarding interplanetary 
dust particles (Extended Data Fig. 4) responsible for the generation 
of the ejecta clouds (Methods subsection “Dust production from 
impacts’). This observed anisotropy is in contrast to the roughly iso- 
tropic ejecta clouds engulfing the Galilean satellites, where the vast 
gravitational influence of Jupiter is efficiently randomizing the orbital 
elements of the bombarding interplanetary dust particles’*. The aniso- 
tropic ejecta production is consistent with existing models of the inter- 
planetary dust distributions near the Earth that combine in situ dust 
measurements, visible and infrared observations of the zodiacal cloud, 
as well as ground-based visual and radar observations of meteors in the 
atmosphere’*’”, The dust production on the lunar surface is domi- 
nated by particles of cometary origin, as opposed to slower asteroidal 
dust particles, which follow near-circular orbits as they migrate 
towards the Sun, owing to Poynting—Robertson drag'*. Meteoroids 
that are of asteroidal origin would be able to sustain only a much 
weaker, more azimuthally symmetric ejecta cloud, contrary to LDEX 
observations. 

In addition to bombardment by interplanetary dust, the exposure of 
airless surfaces to ultraviolet radiation and solar wind plasma flow 
could result in the lofting of small dust particles, owing to electrostatic 
charging and subsequent mobilization’. The charging processes are 
expected to be most efficient over the terminators, where strong loca- 
lized electric fields could exist over the boundaries of lit and dark 
regions. High-altitude horizon glow observations near the lunar ter- 
minator suggested a population of grains, characteristically of radius 
0.1 pm, with a density of n ~ 10* m ° at an altitude of h = 10 km, 
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function of LT at three different altitude bins showing a persistent enhancement 
canted towards the Sun away from the direction of the orbital motion of the 
Earth-Moon system. Error bars were calculated by propagating the V/N error 
through the density calculation, where N is the number of detected dust 
impacts. 
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Figure 4 | LDEX current measurements. The accumulated charge collected 
by LDEX in dt = 0.1 s intervals (Jaust), averaged over the sunrise terminator 
between 5:30 and 6:30 LT. The coloured lines show the predicted value of Jaust 
based on the impacts of small particles alone using the upper limits of the dust 
densities estimated by remote sensing visible**”° and ultraviolet’ observations. 
Error bars show 1a on log/ gust as the current measurements are log-normally 
distributed. Because Jys¢ ~ 10° electrons per second is two orders of magnitude 
lower than the Apollo estimates near an altitude of 10 km and exhibits no 
altitude dependence, LDEX measurements show no evidence for the existence 
of the suggested relatively dense clouds of 0.1-j1m-sized dust particles. LRO is 
the Lunar Reconnaissance Orbiter. 


increasing towards the surface to n = 5 X 10° m *. Follow-up 
observations*” indicated a drastically lower upper-limit of the lofted 
dust densities. At an altitude of 10 km, our dust current measurements 
show an upper limit for the density of particles of radius 0.1 um that is 
approximately two orders of magnitude below the Apollo estimates”. 
However, the LDEX dust current measurements (Methods subsection 
‘The LDEX instrument’) of Jays: 10° electrons per second (Fig. 4) 
remained independent of altitude and, hence, gave no indication of 
the relatively dense cloud of 0.1-1m-sized dust that was inferred from 
the Apollo observations over the lunar terminators. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


The LDEX instrument. The Lunar Dust Experiment (LDEX) is an impact ion- 
ization dust detector, which measures both the positive and negative charges of the 
plasma cloud generated when a dust particle strikes its target””. The amplitude and 
shape of the waveforms (signal versus time) recorded from each impact are used to 
estimate the mass of the dust particles. The instrument has a total sensitive area of 
0.01 m’, which gradually decreases to zero for particles arriving from outside its 
dust field-of-view of +68° off from the normal direction. LDEX is sensitive to 
ultraviolet; hence, its operations, in general, excluded the Sun in its optical field-of- 
view of +90°. 

Particles below the single impact detection threshold generate a plasma cloud in 

the same way as larger impacts, but without triggering a full waveform capture. 
Their collective signal is integrated independently of the impacts of large particles. 
In addition to small dust particles, the integrated ion signal is also sensitive to a 
number of possible background contributions, most importantly ultraviolet 
photons scattering into the ion detector, generating photoelectrons. To identify 
the background contributions in the collective signal, the acceleration potential 
between the target and ion collector is intermittently switched from its nominal 
—200 V to +30 V, making the instrument ‘blind’ to dust. The contributors to the 
current in nominal mode (Jy) are ions from dust impacts, photoelectrons and 
low- and high-energy ions. In switched mode (Js) the contributors are photo- 
electrons and high-energy ions. High energy in this case indicates >30 eV, as 
these ions can reach the microchannel plate even in switched mode. Hence, 
the difference Jy — Js represents the collective signal of small dust particles 
and low-energy ions only. Each dust particle with a radius of 0.1 pm and impact 
speed of 1.6kms°’ is expected to generate” Q; ~ 100e. Their collective current is 
Jaust ~ AvnQ; where A is the detector sensitive area, v is the speed of dust particle 
relative to the spacecraft, and n is the density of the small particles. Hence, attrib- 
uting Jy — Js to small dust particles alone allows us to set an upper limit for n. 
Alternatively, estimates for n from independent observations can be used to 
predict Jaust: which we can then compare to our measurements. The low-energy 
ion contribution may be due to back-scattered solar wind protons”, energetic 
neutral atoms”*”® and the lunar ionosphere”’. Any contribution of low-energy ions 
to Jy — Js would further reduce our estimate of n. 
Dust ejecta clouds. We compare the steady state, spherically symmetric model of 
a dust cloud’’ to the Lt-averaged LDEX observations. The phase space density of 
dust above the surface based on a model for an impact-generated ejecta cloud can 
be written as'*’* 


Nt _ fulu(r) fyWv,0)) 
8x2Rr vu(v)" sin 0 cos W(v,0) 


n(v,0,9; r) 


(1) 


Here, the variables v, 0, @ denote the velocity vector of dust grains at a radial 
distance r from the Moon (lunar radius R = 1,737 km). The distance r is regarded 
as a parameter of the distribution, nota variable. Further, 0 is the angle between the 
velocity vector and the radial direction and ¢ is the velocity azimuth angle (anti- 
clockwise around the radius vector). The distribution does not depend explicitly 
on ¢ for a spherically symmetric cloud. We retain the azimuthal dependence to 
perform averages over quantities that do depend on @. N™ is the total rate of 
grains produced on the surface. We denote by u the starting speed of ejecta on 
the surface and by w the ejection cone angle measured from the surface normal. 
Using the conservation of energy and angular momentum of the two-body prob- 
lem we have = (v, 0) and u = u(y). For the distribution of starting velocities, f,,, 
we use a power law with exponent py (equivalent to y + 1 in other customary 
notation'*), normalized to unity in the range ue[uUp, 00}: 


iat 
fulu)= f Ip uk 
Ug 


For the ejecta cone angles, we use a uniform distribution, fr normalized to unity in 
the range We[0,Wo]: 
sin 
f= 


— cos Wo 


The mass distribution of the grains is uncorrelated with the velocity and is 
described as a power law that is normalized to the total rate of mass production 
in the range mE|Mmin,Mmax| (refs 12 and 18). The generalized version of equation 
(1) becomes 


Mt 1% any full) fil .0)) 


8n2Rr mi — mi vu(v)’ sin 0 cos W(v,0) 


n(m,v.0,9; 7) (2) 
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where M* denotes the total mass production rate that is related to N* as 


+ ate min max 
N M Oo mi oO _ m} a 
max min 


The exponent is expected’*”® to be in the range 0.5 = & = 1. Equation (2) gives the 
number of particles found at distance r from the centre of the moon in the phase 
space volume element d°vd*rdm. 

When a grain of mass m hits the dust detector at a velocity v it produces an 
impact charge” 


q=cmv! (3) 


The combination of equations (2) and (3) can be used to estimate the distribution 
of charges to be recorded by LDEX in the following manner. 

Typically the LADEE spacecraft followed a nearly circular orbit around the 
moon with speed ¥,. relative to the surface. The boresight of LDEX pointed in 
the direction of spacecraft motion (apex) so that the detector encountered dust 
grains of velocity v at a relative velocity v,. — v. The number of grains AN with 
velocity in d°’v and mass in dm, that can reach the detector during time At, is given 
by the number of such grains found in a cylindrical volume spanned by the 
detector surface A and the relative velocity vector (Extended Data Fig. 1) 


AN = Atd’ vdmA(@) cos @ Oy( cos w) n(m,v,0,9; 1)|V—Vsc| (4) 


where c is the angle between the boresight and the relative velocity vector. The 
Heaviside function, O}(cosc), guarantees that we count only grains that can enter 
the detector. With A() we account for the fact that the effective detector area of 
LDEX depends on the angle «. The effective area is maximal for w = 0, dropping to 
zero for wo = 68 degrees (ref. 22). We evaluate cosw in terms of the spacecraft and 
dust velocities, as well as the angles @, 0 by noting that for a circular spacecraft orbit 
Vse — v sin 0 cos 
cos = jes Oe (5) 
|v —Vec| 

Dividing (4) by At we obtain the differential rate dy of particles that impact the 
detector 


dy =dvv’d0d¢ sin 0dmA(@)(V5- —v sin 0 cos $)On(Vs- — v sin 0 cos ) 
n(m,v,0,9; 7) 


where A(q) is expressed with equation (5) as a function of v, 0, $. We re-arrange 
the right-hand side of (6) into a mass and a velocity distribution as 


dy =dvd0dg dm pn(m)p(v.0.9; rWo-tsto) 


where 
l-a (ew 
Pm(rn)=M* mg "O(n — tinin)OvC tnx") (7) 

and 

A@) , 

P(V,0,9; 7. Wo.fUo) = an2Rp (Vs —v sin 0 cos ¢) 
Or(Vs- — v sin 0 cos pve DWWUD) WV.) 
u(v)” cos W(v,0) 

We define 


Tt 2n 
polv)= {eo | dpplv:0.d: tN e.tui) 
0 0 


Using equation (3) we can then 7m 7 express the model prediction for the distri- 
bution of charges detected by LDEX as 


fee) 


P(Q= | dyd(q— mev?) = | dmpm(m) | dvp,(v)d(q— mcv?) 
0 0 
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pv(v) ( | ) 
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Inserting equation (7) and sorting terms gives 
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where the boundaries 


1/B 1/B 
Vmin(q) = (2 ) > Vmax = (4 ; ) 


follow from the normalization of the mass distribution, equation (7). 
Evaluating p,(v) shows that the integral in equation (8) depends only weakly on 
the charge q via the boundaries. Quantitatively, 


Vmax (4) 
f dvp,(v)v¥F 


Vmin(q) 


0.13< <0.23 


a 


ii dvp, (vv 
0 


when changing q from 0.1 fC to 1,000 fC. Therefore, the distribution p,(q) is 
dominated by far by the power law, so that we expect to see 


PalQ)<4q 


in the data. Hence, measuring the exponent of the impact charge distribution 
yields the exponent of the mass distribution of the particles recorded by LDEX. 
Dust production from impacts. Extended Data Fig. 3 and Extended Data Table 1 
were generated assuming that the LT-averaged ejecta cloud is spherically symmetric 
to determine the bulk properties of the cloud and allow for direct comparison with 
previous studies'**°. Here we address the anisotropic nature of the dust influx to the 
lunar surface. To this end we replace the single global dust mass production M* with 
an LT- and time (f)-dependent function of mass production per unit surface area 
M* (xr, t). 

The mass flux of bombarding interplanetary dust particles with mass m and 
velocity v is a function of both the position on the lunar surface and time: F(m, v, 
Lt, t). A single dust particle striking a pure silica surface generates a large number of 
ejecta particles with a total mass m* = mY(m, v), where the yield, Y ~ mi, is 
determined on the basis of laboratory experiments*’. The mass flux of impactors is 
dominated by particles with characteristic mass mp ~ 10° kg (about 100 pm in 
radius)**. Our detected impacts are dominated by ejecta particles generated along the 
ground track of the spacecraft that followed a nearly equatorial orbit. Hence, it is 
convenient to track the position on the lunar surface in LT, with Lr = 0, 6, 12, 18 
marking midnight, the dawn terminator, the sub-solar point and the dusk termin- 
ator, respectively. The mass production rate per surface area as function of LT is 
found by integrating the product of the interplanetary dust flux F and the yield Y 
around the lunar equator 


—(1+2) 


M* (it,t)= { Fonvsrv(ns)dmay (9) 


We evaluated equation (9) using both NASA’s MEM" and ESA’s IMEM”’ models, 
which agree well near one astronomical unit. The flux F and the predicted mass 
production rates M * (xr, £) are shown in Extended Data Fig. 4, and are consistent 
with the asymmetric dust ejecta cloud observed by LDEX. We note that the 
meteoroid population still remains one of the most uncertain space environment 
components”. 

Data availability. All LDEX data are available through NASA’s Planetary Data 
System (http://sbn.psi.edu/pds/resource/ldex.html). 

Code availability. The code used for calculating the flux of interplanetary dust 
particles reaching the lunar surfaces is available upon request from NASA (http:// 
www.nasa.gov/offices/meo/software). 
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Extended Data Figure 1 | Detection geometry. A particle of velocity vis 
recorded by a detector of sensitive area A. The surface normal of the detector 
area points along the velocity vector of the spacecraft v,.. The particle enters the 
instrument with an angle w measured between the instrument boresight and 
the relative velocity vector of the particle v,. — v. 
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Extended Data Figure 2 | Systematic approximation error and its 
dependence on ejection parameters. a, The calculated density for a standard 
set of parameters listed in Extended Data Table 1 for the model ejecta cloud’’ as 
function of altitude (black line) normalized to the production rate N +. The 
density is recalculated using n = y/(Av,.) (red line), the approach taken in this 
paper to infer the dust density from the measured impact rates y, indicating an 


underestimate of <20% for altitudes below 100 km. b, Contour plot of the 
ratio of the ‘true’ model density over the recalculated density at the altitude 

h = 50 km, as a function of the opening cone angle of the ejecta plume Wp and 
the exponent of the power-law initial-speed distribution 1, appropriately 
setting the minimum speed ug, while keeping the maximum speed constant at 
2Vescape Maintaining a constant total kinetic energy of the ejecta particles. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 10.00 
4.96 
= 2.46 5. 
°o g 
= 3 
> x 
2 1.22 § 
3) 6 
a 
0.60 
0.30 
0 20 40 60 80 100 120 
Altitude [km] 
b oa 
gz 
y) 
rn) 
& 
= 
© 
2 
& 
=} 
= 
=} 
o) 
0 20 40 60 80 100 120 
Altitude [km] 
Cc 
0.06 
S 0.04 
0.02 
0.00 
0 100 200 300 400 500 600 
u [m/s] 
Extended Data Figure 3 | Comparison of observed and modelled cloud normalized vertical velocity distribution f(u) calculated from n(h) using energy 


properties. a, The dust density n(h) of the lunar ejecta cloud as function of conservation. The continuous line shows f(u) « u** *°! matched to the data 
altitude and size (colour scale). The continuous black line shows the model at u= 400 ms ! (altitude h ~ 50 km). Error bars were calculated by 
prediction” using the best-fit parameters listed in Extended Data Table 1. propagating the V/N error through the various calculations, where N is the 

b, The cumulative dust mass in the lunar exosphere. The continuous blue line number of detected dust impacts. 

shows the ejecta model prediction (Extended Data Table 1). ¢, The initial 
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Extended Data Figure 4 | Modelled flux and mass production in the lunar —_ monthly averages). b, The mass production rate, equation (9), calculated using 
equatorial plane. a, The calculated flux of interplanetary dust particles Fn, | amodel for the spatial and velocity distributions of interplanetary dust particles 
reaching the lunar equatorial region as a function of LT and t (colour coded for _ near the Earth’®, consistent with the observed asymmetric dust cloud. 
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Extended Data Table 1 | Parameters of the theoretical ejecta cloud model’? for the Moon 
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Parameter 


Definition 


Value 


mass distribution 


Nt(> m) x m° 


a = 0.91 + 0.003 


smallest mass* hme S-10 lee 
largest mass** Mmax 10-8 kg 
speed distribution fale) xu F peH3Acet 0.1 
minimum speed uo 130 m/s 
maximum speed Umax = 2° Vescape 4.8 km/s 
impactor speed times 20 km/s 
ratio of ejecta/impactor kinetic energy Ke/K; 20% 
ratio of ejecta/impactor mass ¥ 1000 
initial velocity maximum cone angle Wo 30° 


"radius Gite. = O53 gmt Garey = 100 pom 


These parameters form a consistent set, and are not independent of each other*°. 
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Multiple complexation of CO and related ligands 


to a main-group element 
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Alfredo Vargas” & Qing Ye! 


The ability of an atom or molecular fragment to bind multiple 
carbon monoxide (CO) molecules to form multicarbonyl adducts 
is a fundamental trait of transition metals. Transition-metal car- 
bonyl complexes are vital to industry, appear naturally in the active 
sites of a number of enzymes (such as hydrogenases), are promising 
therapeutic agents’, and have even been observed in interstellar 
dust clouds’. Despite the wealth of established transition-metal 
multicarbonyl complexes’, no elements outside groups 4 to 12 of 
the periodic table have yet been shown to react directly with two or 
more CO units to form stable multicarbonyl adducts. Here we 
present the synthesis of a borylene dicarbonyl complex, the first 
multicarbonyl complex of a main-group element prepared using 
CO. The compound is additionally stable towards ambient air and 
moisture. The synthetic strategy used—liberation of a borylene 
ligand from a transition metal using donor ligands—is broadly 
applicable, leading to a number of unprecedented monovalent 
boron species with different Lewis basic groups. The similarity of 
these compounds to conventional transition-metal carbonyl com- 
plexes is demonstrated by photolytic liberation of CO and sub- 
sequent intramolecular carbon-carbon bond activation. 

The ubiquity and utility of transition-metal carbonyl complexes can 
be attributed partly to CO’s combination of high-lying filled and low- 
lying vacant orbitals, which allows it to take advantage of the versatile 
d-orbital arrangement of transition metals and thus engage in synergic 
o bonding and x backbonding. In contrast, surprisingly few non-d- 
block elements form monoadducts with CO; they include a handful of 
complexes of uranium*, phosphorus’, boranes (BR;)°, polyboranes’, 
and certain carbenes*. Only two multicarbonyls of the elements of the s 
and p blocks have been crystallographically characterized”"®: carbon 
suboxide (OCCCO) and the cation in the salt [OCNCO]* [Sb3Fi6]. 
However, neither can be prepared by carbonylation of the central atom 
with CO itself. 

The relatively recent realization that low-valent main-group species 
can act as synthetic mimics of transition-metal species has led to a 
flourishing area of chemistry centred around their synthesis and use as 
reagents for small-molecule activation''"*. Double Lewis-base adducts 
of main-group elements have recently attracted theoretical interest in the 
literature with the development of the concept of ‘carbones’, twofold 
Lewis adducts of a single carbon atom of the form L—C<-L (where L 
is a Lewis base)'*”°. In a similar manner, the synthesis of twofold base 
adducts of monovalent boron extended this concept to elements beyond 
carbon*'™, and more recently analogous ‘-ones’ of heavier group 14 
elements have also been realized’*. This rethinking of the bonding in 
main-group compounds in terms of bonding between a Lewis donor and 
a Lewis acceptor has recently spurred heated debate in the literature***. 

While earlier reports on carbon suboxide and its cationic nitrogen 
analogue presume a cumulenic (O = C = E = C = O) (where Eis C 
or N) structure*’®, Frenking and co-workers have suggested that 
these compounds might be better considered as Lewis adducts 
(OC-—E<CO). In contrast, they predicted that the hypothetical 


borylene dicarbonyl HB(CO), would contain much more E>CO 2 
backbonding than in the carbon and nitrogen dicarbonyls (when E is 
boron, m backbonding is 33.8% of the total bonding energy, whereas it 
is 5.2% and 10.0% for nitrogen and carbon, respectively)”, making the 
boron analogue much more akin to transition-metal bis(CO) counter- 
parts. Frenking and co-workers” additionally made the prediction that 
“substituted homologues [of HB(CO),] with bulky groups at boron 
which protect the boron atom from electrophilic attack might also be 
stable enough to become isolated.” Here we prove this prediction to be 
correct by the synthesis of a remarkably air-stable, neutral and crys- 
talline borylene dicarbonyl, related monovalent boron compounds, 
and the reactivity thereof. 

Stirring a CO-saturated benzene solution of the terminal borylene”® 
complex [(OC);Mo(BTp)] (compound 1b, where Tp is 2,6-di(2,4,6- 
triisopropylphenyl)) at reflux for 18 h resulted in a colour change from 
yellow to blue. After removal of molybdenum hexacarbonyl by frac- 
tional crystallization from toluene, blue crystals were obtained (com- 
pound 2, see Fig. 1; note that the monovalent boron compounds 2-4 
and 8 are depicted in both conventional covalent forms and Lewis 
donor/acceptor forms in this figure, according to the conflicting ideas 
set out in the aforementioned debate on the existence of dative 
bonds in main-group compounds**”*). This compound showed a 
strongly upfield-shifted '"B nuclear magnetic resonance (NMR) signal 
(6 -32.2) compared to that of the precursor 1b (6 155) and signals 
corresponding to the Tp fragment in the 'H NMR spectrum. The 
two strong bands in the carbonyl region of its infrared spectrum (at 
1,942 cm | and 2,060 cm 1) are also reminiscent of the symmetrical and 
unsymmetrical stretches of transition-metal cis-dicarbonyl complexes. 
The crystallographically determined molecular structure of 2 (Fig. 2a) 
confirmed the dicarbonylation of the boron atom and the presence of 
a borylene dicarbonyl, with two essentially linear B-C-O axes (the 
B-C-O angles are 173.9(1)° and 174.2(2)°). The C-O distances in 
2 (1.150(2) A and 1.152(2) A) are within the range of C=O bonds 
in crystallographically characterized ketenes’ (1.13-1.18 A), indicating 
a lower bond order of the CO units than in free carbon monoxide anda 
substantial amount of m-electron delocalization across the five-atom 
OCBCO unit. The B-C bonds of 2 (1.473(3) A and 1.475(3) A) are 
furthermore distinct from those of borane carbonyls (R;B<—-CO; 
typical B-C distances’ are 1.6-1.7 A), which feature markedly longer 
B-C distances indicative of single bonds. A related, structurally char- 
acterized borylene monocarbony] has recently been reported”, con- 
taining a much more localized B~C=O unit (the C-O bond length is 
1.091(3) A; that for B-C is 1.529(5) A), making it thus more remin- 
iscent of the aforementioned family of borane carbonyl adducts. The 
large structural differences between 2 and borane carbonyl adducts 
(including the compound of ref. 24) suggests that 2 is the first com- 
pound in which a boron atom binds CO with strong nm backbonding 
from the boron to the CO, as is observed in traditional transition-metal 
carbonyl complexes, thus revealing boron’s ability in this case to act as 
a transition-metal mimic. As a test of its stability to air and moisture, 
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Figure 1 | Synthesis and reactivity of bis(CO), bis(isonitrile) and mixed 
adducts of boron. Note that in solution, compound 4 adopts a borylene 
isocyanide form akin to compound 8. Tp is 2,6-di(2,4,6-triisopropylphenyl)- 


a solid sample of 2 was kept in an ambient atmosphere for five days, 
after which time its blue colour remained unchanged and a ‘H NMR 
spectrum of this sample indicated approximately 30% decomposition. 
Treatment of terminal borylene complex [(OC);Cr(BTp)] (1a) with 
2,6-diisopropylphenylisocyanide (DippNC) at room temperature led 
to a similarly high-field signal in its 'B NMR spectrum (6 -28.4) and 
the isolation of blue crystals. However, 'H and ‘°C NMR spectroscopy 
and single-crystal X-ray crystallography (Fig. 2a) indicated that this 
compound (3, Fig. 1) was the product of monocarbonylation of the 
boron atom (presumably via migration from the M(CO); fragment), 
with concomitant addition of one isocyanide. Compound 3, while 
sharing a similar L—+B(R)(CO) connectivity to the aforementioned 
borylene monocarbonyl of ref. 24, differs dramatically in terms of 
delocalization of the boron atom with the CO unit, having much longer 
C-O bonds (1.161(3) A) and much shorter B—-C bonds (1.464(4) A). 
The slightly bent isocyanide CNC unit in 3 (162.5(2)°) also indicates 
weak delocalization over the BCN unit. The ability of 3 to act like a 
‘masked’ free borylene was indicated by its behaviour under photolytic 
conditions for four days, after which colourless crystals (5, Fig. 1) were 
isolated. Compound 5, a conventional isocyanide-borane adduct, is 
the product of decarbonylation and formal borylene insertion into a 
carbon-carbon bond of the terphenyl substituent Tp. Such C-C bond 
insertions are hallmarks of reactive (free) borylene chemistry”®, the for- 
mation of which can be envisaged via photolytic decarbonylation of 5. 
The ability of 3 to photolytically liberate CO and break strong element- 
element bonds further reinforces its comparison with transition-metal 
carbonyl complexes, for which this is a key reactivity motif. 
Alternatively, addition of three molar equivalents of methylisocya- 
nide to terminal borylene complex 1a led to the isolation of red crystals 
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phenyl. Dur is 2,3,5,6-tetramethylphenyl. Dipp is 2,6-diisopropylphenyl. RT, 
room temperature. 


(4, Fig. 1) that showed a ''B NMR signal (6 -18.8) to slightly lower field 
than those of 2 and 3. The 'H NMR spectrum of 4 indicated 
the existence of two equivalent methylisocyanide units in the com- 
pound; however, in the solid state this equivalence does not 
persist. The crystallographically determined molecular structure of 
4 (Fig. 2a) shows the compound to be effectively an isocyanide- 
stabilized boraketenimine, wherein one DippNC group adopts a 
linear, non-BC-delocalized arrangement (C-N-C angle is 171.2(2)°), 
while the other adopts a bent, imine-like geometry (C-N-C 
angle is 126.5(1)°). In a similar manner, addition of three molar 
equivalents of DippNC to the terminal iron borylene complex 
[(OC)3(Me3P)Fe(BDur)] (6; where Dur is 2,3,5,6-tetramethylphenyl) 
led to isolation of an analytically pure red oil (8, Fig. 1). Multinuclear 
NMR spectroscopy and single-crystal X-ray crystallography of 8 
(Fig. 2a) indicated the presence of a borylene bis(isonitrile) similar 
to 4, but in which the isonitriles are more geometrically equivalent. 
When borylene 6 was treated with only two molar equivalents of 
DippNC, a 73% yield of an iron-containing complex (7, Fig. 1) was 
obtained that showed extensive *'P-'°C and *'P-'H coupling in 
the °C and 'H NMR spectra, respectively, and a 31D NMR signal 
at 6 11.55. A ''B NMR signal of 7 was found at 6 -37.5, to even 
higher field than those of the formally monovalent boron com- 
pounds 2-4 and 8. The crystallographically determined molecular 
structure of 7 showed this compound to be effectively a complex of 
8 with the zero-valent [Fe(CO)3(PMe;)] fragment, bound as a base- 
stabilized boraketenimine in a m fashion through one B=C bond. 
The CNC unit of the Fe-bound DippNC group of 7 is bent (at 
128.1(2)°) and imine-like. The presumed role of complex 7 as an 
intermediate in the borylene liberation that leads to 8 was confirmed 
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Figure 2 | Crystallographic and computational results on the molecules 
prepared herein. a, Molecular structures of 2-5, 7 and 8 as derived from 
single-crystal X-ray crystallography. Thermal ellipsoids in the structures 
represent the 50% probability level. For clarity, hydrogen atoms and some 
thermal ellipsoids have been removed. Selected bond lengths and angles are 
given as follows. For 2: B-Cl 1. 475(3) A, C1-O1 1.152(2) A, B- C21, 473(3) A, 
C2-O2 1.150(2) A and B-C3 1.571(3) A. For 3: B-Cl 1.466(3) A, C1-O1 
1.159(2) A, B-C2 1.485(3) A, C2-N 1.177(2) A and B-C3 1.574(2) A. For 

4: B-Cl 1.504(2) A, C1-N1 1.153(2) A, B-C3 1.450(2) A, C3-N2 1.210(2) A, 
B-C5 1.569(2) A, C1-N1-C2 171.2(2)° and C3-N2-C4 126.5(1)°. For 5: B-Cl 


by the addition of a further equivalent of DippNC, furnishing 8 
quantitatively, as judged by NMR spectroscopy. 

Kohn-Sham density functional theory calculations at the B3LYP/ 
6-311G* level were also performed on 2-4 (see Supplementary Figs 28 
and 29). Although the natural charges at the boron atoms of 2-4 are all 
negative, they become more neutral as the number of carbonyls is 
decreased (2: -0.21; 3: -0.13; 4: -0.04, Fig. 2b). However, the calculated 
distances between the boron atoms and the sp-hybridized carbon atoms 
of all three compounds do not vary much (1.474-1.487 A), and nor do 
the corresponding Wiberg bond indices, which are without exception 
greater than unity (1.23-1.30) and reflect the extensive boron-to-ligand 


1.584(2) A and C1-N 1.146(2) A. For 7: B-Fe 2.265(3) A, C1-Fe 2.046(3) A, 
B-Cl 1.546(4) A, C1-N1 1.247(3) A, B-C3 1.518(4) A, C3-N2 1.154(3) A, 
B-C1-NI1 141.7(3)°, and C1-N1-C2 128.1(2)°. For 8: B-Cl 1.472(3) A, 
C1-N1 1.187(2) A, B-C3 1.469(3) A, C3-N2 1.186(2) A, B-C5 1.590(3) A, 
C1-N1-C2 149.9(2)° and C3-N2-C4 145.2(2)°. b, Computed geometries of 
2-4, Interatomic distances (in A) are given in black, Wiberg bond indices 

in parentheses, and natural charges in red. c, Computed HOMOs of 
compounds 2-4, where red and blue indicate different phases of the 
wavefunction. 


backbonding in these molecules. Analysis of the frontier orbitals of 2-4 
(Fig. 2c) indicates in all cases that the highest occupied molecular 
orbital (HOMO) level consists of three-centre m bonding interactions 
between the boron atom and the attached (carbonyl and/or isocyanide) 
groups. This backbonding from the boron atom’s filled p orbital into 
the 1* orbitals of the ‘ligands’ is reminiscent of the well known metal- 
to-carbon nt backbonding seen in transition-metal carbonyl complexes. 
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An enigmatic plant-eating theropod from the Late 


Jurassic period of Chile 


Fernando E. Novas’, Leonardo Salgado’*"*, Manuel Suarez”, Federico L. Agnolin®®, Martin D. Ezcurra’, Nicolas R. Chimento’, 
Rita de la Cruz®, Marcelo P. Isasi**, Alexander O. Vargas” & David Rubilar-Rogers”"'° 


Theropod dinosaurs were the dominant predators in most Mesozoic 
era terrestrial ecosystems’. Early theropod evolution is currently 
interpreted as the diversification of various carnivorous and curs- 
orial taxa, whereas the acquisition of herbivorism, together with the 
secondary loss of cursorial adaptations, occurred much later among 
advanced coelurosaurian theropods'”. A new, bizarre herbivorous 
basal tetanuran from the Upper Jurassic of Chile challenges this 
conception. The new dinosaur was discovered at Aysén, a fossil 
locality in the Upper Jurassic Toqui Formation of southern Chile 
(General Carrera Lake)**. The site yielded abundant and exquisitely 
preserved three-dimensional skeletons of small archosaurs. Several 
articulated individuals of Chilesaurus at different ontogenetic stages 
have been collected, as well as less abundant basal crocodyliforms, 
and fragmentary remains of sauropod dinosaurs (diplodocids and 
titanosaurians). 


Theropoda Marsh, 1881 
Tetanurae Gauthier, 1986 
Chilesaurus diegosuarezi gen. et sp. nov. 


Etymology. In reference to Chile, and honoring Diego Suarez, who at 
the age of 7, discovered the first bone remains in the Toqui Formation. 
Holotype. Servicio Nacional de Geologia y Mineria, Chile (SNGM)- 
1935 consists of a nearly complete, articulated skeleton, approximately 
1.6m long (Fig. 1, Supplementary Information and Extended Data 
Fig. 1). Holotype specimen was skeletally immature at the time of its 
death, as evidenced by the incomplete fusion of neurocentral sutures. 
This ontogenetic inference agrees with the size of the holotype, which 
represents 50% the length of the larger specimen SNGM- 1888 (ref. 3). 
Paratypes. Postcranial skeletons of four individuals, corresponding to 
different ontogenetic stages, ranging approximately from 1.2 to 3.2m 
in total length (Extended Data Table 1). Several specimens referred to 
as indeterminate theropods and tetanurans previously’ are here 
referred to as Chilesaurus diegosuarezi. 

Locality and horizon. Central Patagonian Cordillera, Aysén (Chile; 
approximately 46° S); Toqui Formation**, Tithonian, latest Jurassic. 
Diagnosis. Chilesaurus differs from other dinosaurs in the following 
combination of autapomorphies: premaxilla short and deep, with 
prominent plate-like postnarial process; teeth leaf-shaped, being finely 
denticulate only on the crown apex of erupting teeth; coracoid sub- 
quadrangular in side view and with transversely thick margins; manual 
digit II with short pre-ungual phalanges; manual digit III atrophied; 
iliac blade with posterodorsal prominence; ischiadic peduncle of ilium 
robust; supracetabular crest absent; pubis fully retroverted; pubic 
shaft rod-like and distally unexpanded; femoral mediodistal crest 
absent; tibia without fibular crest. In addition, Chilesaurus shows the 


following unique combination of characters: dentary deeper anteriorly 
than posteriorly; cervicals with septate and paired pleurocoels; pubic 
apron transversely narrow; ischia connected through a proximodistally 
extended medial lamina (‘ischial apron’); femoral greater trochanter 
anteroposteriorly expanded, astragalar ascending process lower than 
astragalar body; calcaneum subtriangular in distal view; metatarsal I 
robust, elongate, and proximally compressed transversely; metatarsal II 
transversely wider than the other metatarsals; pedal digit I large. 

Isolated skull material (including premaxilla, maxillae, frontals, 
postorbital, squamosal, basicranium, ectopterygoid and dentary; 
Extended Data Fig. 2), suggests a proportionally small head for 
Chilesaurus. The premaxilla is short and deep, with a rugose rostral 
margin that suggests a ramphotheca (Fig. 2). Frontals are elongate and 
narrow and participate extensively in the orbital margin. The basi- 
sphenoidal recess is deep. The dentary is short and deep, with a down- 
turned symphyseal region but a straight alveolar margin. Dentary teeth 
are tall, leaf-shaped, and procumbent, with small serrations restricted 
to the crown apex (Fig. 2). 

Cervical vertebrae are long and low, forming a slender neck. 
Cervical and anterior dorsal vertebrae possess a pair of septate pleur- 
ocoels, which are absent posterior to the pectoral region (Fig. 1g). 
‘Pectoral’ vertebrae bear prominent hypapophyses. 

The scapular blade (Extended Data Fig. 3) is elongate and slightly 
anteroposteriorly expanded distally, as in basal averostrans’. The cor- 
acoid is subquadrangular and lacks theropod characteristics such as the 
posteroventral process and biceps tuberosity’. It is notably thick trans- 
versely, contrasting with the delicate anterodorsal and dorsal margins 
of most dinosaurs. The limb bones are stout, as in sauropodomorphs, 
and forelimb length is 56% that of hind limbs. The humerus is prox- 
imodistally short and transversely wide (Extended Data Fig. 3). The 
single proximal carpal is large, with a transversely convex proximal 
articular surface. Metacarpals I-III are present, but only manual digits I 
and II are well developed (Fig. 1d-f). Metacarpal I is stout, and phalanx 
1-I is short and strongly twisted along its main axis, as in basal saur- 
opodomorphs’. The ungual of digit I is shorter than metacarpal II and 
less curved than most basal tetanurans*’”. Metacarpal II is the longest, 
and its digit presents strongly shortened pre-ungual phalanges, as in 
some ceratosaurians*. Metacarpal II is much more slender than in 
basal theropods, and its digit comprises a single minute phalanx. 

The ilium is dolichoiliac, typical for Theropoda' (Fig. 1b). The pubic 
pedicle is elongate (as in sauropods, ornithischians and therizino- 
saurs*®), and the ischiadic peduncle is bulbous, as in ornithischians 
and alvarezsaurid coelurosaurs”®. A prominent supratrochanteric pro- 
cess is present on the posterodorsal corner of the ilium, similar to those 
of sauropods, therizinosaurs, paravians, and some ornithischians''’. 
The acetabular roof is transversely narrow and a supracetabular crest 
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Figure 1 | Skeletal anatomy of Chilesaurus diegosuarezi gen. et sp. nov. 

a, Reconstructed skeleton (SNGM-1935). b, Fourth cervical vertebra (SNGM- 
1935) in right lateral view, with a close-up of tabicated anterior pleurocoel. 

c, d, Composite reconstruction of right hand (carpals, metacarpals, and non- 
ungual phalanges of digits I and II are from specimen SNGM-1935; ungual 
phalanges I and II are from specimen SNGM-1937; metacarpal III is from 
specimen SNGM-1887) in dorsal (c) and medial (d) views. e, Pelvic girdle 
(SNGM-1936) in right lateral view. f, Articulated ischia (SNGM-1936) in 


is absent, as occurs in derived coelurosaurs and ornithopods’””. 


The pubis (Fig. 1b, c) closely resembles that of basal ornithischians, 
therizinosaurs and dromaeosaurid paravians in being fully retroverted, 
with a reduced proximal end bearing a posteriorly open obturator 
notch’. The pubic apron is transversely narrow and the pubis has a 
rod-like shaft. It is distally unexpanded with a rounded contour, shar- 
ply contrasting with the prominent distal ‘foot’ of other theropods’. 
The ischium (Fig. 1a, b) is proximally expanded and lacks an obturator 
process. Notably, both ischia are connected through a proximodistally 
extended medial lamina (ischial apron), a feature reported in some 
megalosauroids™. 

The femur is robust (Extended Data Fig. 4). The greater trochanter 
is anteroposteriorly expanded, similar to coelurosaurs. The anterior 
trochanter is wing-like and proximally projected, and the fourth tro- 
chanter is semicircular. The distal third of the femur resembles saur- 
opodomorphs in that it lacks distinctive theropod features, such as an 
anteromedial elliptical muscle scar, a mediodistal crest and its assoc- 
iated medial adductor fossa. The cnemial crest of tibia (Extended Data 
Fig. 4) is rounded in lateral view, as in basal sauropodomorphs and 
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posterior view. g, Articulated pubes (SNGM-1936) in anterior view. 

h, Proximal tarsals (SNGM-1888) in distal view. i, j, Left astragalus (SNGM- 
1936) in proximal (1) and anterior (m) views. k-m, Left pes (SNGM-1937) in 
dorsal (kk), medial (1) and proximal (m) views. ap, ascending process; ast, 
astragalus; b, basin; c, carpal; calc, calcaneum; ia, ischiadic apron; is, ischium; 
mtc, metacarpal; p, pubis; poz, postzygapophysis; pps, posterior pleurocoel; prz, 
prezygapophysis; ps, pubic symphysis; I, II, IIL, IV, digits first to fourth. 


basal ornithischians, and differs from the subtriangular crest 
present in most basal theropods’’’. As in sauropodomorphs, and 
unlike most theropods, the proximal end of the tibia lacks a fibular 
crest’. Like theropods, the distal end of the tibia is anteroposteriorly 
compressed with a laterally extending malleolus, more transversely 
expanded than in coelophysoids, but less than in tetanurans’. 
The fibula lacks a proximomedial pocket and iliofibular tubercle. 
The tarsus (Fig. 1k-m) resembles basal saurischians’®. The astragalar 
ascending process is low and broad, unlike tetanurans, in which 
it is laminar, tall, and transversely wide’*'”. The proximal astragalar 
surface possesses a deep, well-defined basin posterior to the ascending 
process, as in basal dinosaurs. As in early saurischians (for example, 
Herrerasaurus, basal sauropodomorphs), the calcaneum of Chilesaurus 
is transversely wide and subtriangular in distal view, rather than rect- 
angular and disc-shaped as in theropods'’*’’”. The foot is wide and 
proximodistally short (Fig. 1h-j). The proximal half of metatarsal I is 
transversely compressed but anteroposteriorly expanded, unlike most 
theropods, in which it tapers proximally. Metatarsal I of Chilesaurus 
represents more than 50% of the length of metatarsal II, in contrast with 
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Figure 2 | Selected cranial bones and teeth of Chilesaurus diegosuarezi gen. 
et sp. nov. (SNGM-1935). a, Partial right (?) maxilla in lateral view. b, Left 
premaxilla in medial view. c, Right dentary in lateral view. d, Details of dentary 


most other theropods (25-33%)'"”. Metatarsal III is the longest, but 
metatarsal II is the thickest, as in basal sauropodomorphs. Digit I is 
large, only slightly shorter than digit II, approaching the tetradactyl 
condition of early sauropodomorphs, ornithischians'* and derived 
therizinosaurs’. 

The bizarre anatomy of Chilesaurus raises interesting questions 
about its phylogenetic relationships. We scored Chilesaurus into four 
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Figure 3 | Phylogenetic relationships of Chilesaurus diegosuarezi gen. et sp. 
nov. among main dinosaur clades and its plots in the theropod 
morphospaces. a, Time-calibrated simplified strict consensus tree, in which 
the green bars indicate herbivorous dinosaur lineages”. E, Early; L, Late; M, 
Middle. Aal, Aalenian; Alb, Albian; Apt, Aptian; Baj, Bajocian; Ber, Berriasian; 
Brm, Barremian; Bth, Bathonian; Clv, Callovian; Crn, Carnian; Cmp, 
Campanian; Cnm, Cenomanian; Con, Coniacian; Hau, Hauterivian; Het, 


© Neotetanurae 


teeth in lingual view. e, Crown of unerupted dentary tooth. f, Detail of the carina 
of an unerupted tooth (arrows indicate denticle positions). alv, alveoli; cr, 
crown tooth; ro, root tooth. 


different integrative archosauriform, theropod and sauropodomorph 
data sets”’?*". Remarkably, all these analyses placed Chilesaurus as 
a member of Theropoda, near the origin of tetanurans° (Fig. 3a), 
dismissing conceivable positions near Therizinosauria, Sauropodo- 
morpha or Ornithischia. The theropodan position of Chilesaurus is 
supported by pleurocoels in cervical and anterior dorsal vertebrae; 
hypapophyses on ‘pectoral’ vertebrae; preacetabular process of ilium 


Paraves 


Ornithomimosauria 
Therizinosauria 
Alvarezsauroidea 
Oviraptorosauria 


Maniraptora 
2 Coelurosauria 


5 
PCO1 


Hettangian; Kim, Kimmeridgian; Maa, Maastrichtian; Nor, Norian; Oxf, 
Oxfordian; Plb, Pliensbachian; Rh, Rhaetian; Sin, Sinemurian; Stn, Santonian; 
Tth, Tithonian; Toa, Toarcian; Tur, Turonian; Vlg, Valanginian. Numbers 
indicate millions of years ago. b-d, Chilesaurus diegosuarezi gen. et sp. nov. 
plotted in the theropod morphospace (principal coordinate (PCO)1 versus 
PCO2) based on axial skeleton (b); scapular girdle and forelimb (c); and pelvic 
girdle (d). Red dots in b-d indicate the position of Chilesaurus. 
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dorsoventrally expanded; femoral fourth trochanter semicircular; and 
tibia distally expanded and with lateral malleolus extending strongly 
laterally. Tetanuran affinities are supported by scapular blade elongate 
and strap-like; distal carpal semilunate; and manual digit III reduced 
(Supplementary Information). For a basal tetanuran, Chilesaurus pos- 
sesses a number of surprisingly plesiomorphic traits on the hindlimbs, 
especially in the ankle and foot, which resemble basal sauropodo- 
morphs””"”. These features are here considered as secondary reversals 
that might be related to a less-cursorial mode of locomotion. 
Furthermore, derived features of the dentary and teeth shared by 
Chilesaurus, sauropodomorphs and therizinosaurs are interpreted as 
homoplasies related to herbivorous habits**°. In this context, pubic 
retroversion of Chilesaurus may be related to an increased gut capacity 
for processing plant material”. 

The discovery of Chilesaurus lends support to the interpretation 
that dietary diversification towards herbivory was more common- 
place among basal theropods than previously thought. Independent 
evolution of herbivory has been recognized for several major coelur- 
osaurian subclades”, but for just a single probable example outside 
Coelurosauria (that is, the toothless ceratosaurian Limusaurus*®’’). 
Chilesaurus expands the list of non-coelurosaurian theropods that 
shifted their diet from carnivore to herbivore. 

Chilesaurus represents an extreme case of mosaic evolution among 
dinosaurs, owing to the presence of dental, cranial and postcranial fea- 
tures that are homoplastic with multiple disparate groups. Using quant- 
itative morphospace analysis, we explored morphospace occupation of 
different skeletal regions in Chilesaurus with respect to a variety of avian 
and non-avian theropods. This shows that Chilesaurus has a ceratosaur- 
like axial skeleton, a ‘basal tetanuran’ forelimb and scapular girdle, a 
coelurosaur-like pelvis, and a tetanuran-like hindlimb (Fig. 3b-d and 
Extended Data Fig. 5). General ankle and foot construction does not 
group with any theropod clade, probably as a result of the characters 
shared by Chilesaurus, sauropodomorphs and herrerasaurids. 

Chilesaurus is the numerically dominant taxon in the Aysén tet- 
rapod fossil assemblage, and represents an unusual case of a theropod 
having the palaeoecological role of a preeminent small-to-medium 
sized herbivore in a Jurassic ecosystem. This is in sharp contrast with 
other Late Jurassic dinosaur assemblages (for example, Tendaguru and 
Morrison formations'!”’), in which ornithischian dinosaurs are the 
most abundant small-to-medium sized herbivores. Available evidence 
indicates that Chilesaurus is a unique dinosaur lineage known only 
from southern South America, suggesting an outstanding case of 
endemism among otherwise relatively cosmopolitan worldwide 
Jurassic dinosaur faunas”*. 

Chilesaurus illustrates how much relevant data on the early diver- 
sification of major dinosaur clades remain unknown. It also provides 
an important cautionary benchmark in our attempts to gain a reliable 
view of the overall evolutionary history of Dinosauria. 
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Extended Data Figure 1 | Holotype of Chilesaurus diegosuarezi gen. et sp. _ cervical vertebrae; Dv, dorsal vertebrae; Lf, left forelimb; Lfem, left femur; Lsc, 
nov. (SNGM-1935) as it was found in the field. Cd, caudal vertebrae; Cv, left scapulocoracoid; Ltib, left tibia; Rf, right forelimb; Sk, skull. 
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Extended Data Figure 2 | Selected skull bones of Chilesaurus diegosuarezi _d, right dentary in medial view. al, alveolus; ao, antorbital opening; or, orbital 
gen. et sp. nov. (SNGM-1935; holotype). a, Right frontal in dorsal view; rim; t, teeth; tf, lower temporal fossa rim. 
b, right postorbital in lateral view; c, incomplete right (?) maxilla in lateral view; 
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Extended Data Figure 3 | Scapular girdle and selected forelimb bones of 1938) in anterior view; d, left radius (SNGM-1935, holotype) in lateral view; 
Chilesaurus diegosuarezi gen. et sp. nov. a, b, Left scapula and coracoid and e, left ulna (SNGM-1935, holotype) in lateral view. cf, coracoid foramen; 
(SNGM-1938) in lateral (a) and posterior (b) views. c, Left humerus (SNGM- __ dp, deltopectoral crest; gl, glenoid cavity; ol, olecranon process. 
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Extended Data Figure 4 | Selected hindlimb elements of Chilesaurus posterior (c), medial (d) and proximal (e) views. at, anterior trochanter; cn, 
diegosuarezi gen. et sp. nov. a, b, Right femur (SNGM-1935) in anterior cnemial crest; f, fibula; fh, femoral head, 4t, fourth trochanter; gt, greater 
(a) and lateral (b) views. c-e, Articulated right tibia and fibula (SNGM-1935) in trochanter, ic, inner condyle; oc, outer condyle; t, tibia; tfc, tibiofibular crest. 
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Extended Data Figure 5 | Bivariate plots showing the results of the 
morphospace occupation analysis of Chilesaurus diegosuarezi gen. et sp. 
nov. based on characters of different regions of the skeleton, taken from the 
modified data matrix and the first and second axes of the principal 
coordinate analysis. a—d, Bivariate plots using all the characters (a), cranial 
characters (b), hindlimb zeugopodium and stylopodium characters (c), and 
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tarsal and pedal characters (d). The convex hulls represent different 
dinosauriform groups rather than statistically distinct clusters. Light grey 
polygon, non-neotheropod dinosauriforms; blue polygon, non-averostran 
neotheropods; green polygon, ceratosaurs; pink polygon, megalosauroids; dark 
grey polygon, allosauroids; light blue polygon, coelurosaurs; red dot, 
Chilesaurus. 
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Extended Data Table 1| Selected postcranial measurements of 
three specimens of Chilesaurus diegosuarezi gen. et sp. nov. 


Scapula « 
Radius 

Humerus 

Metacarpal III 

lliac blade 


Pubis 
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Tibia 
Metatarsal | 
Metatarsal II 
Metatarsal III 


Measurements are in mm. All the values represent maximum measurable lengths. Maximum deviation 
of the digital calliper equals 0.02 mm but measurements were rounded to the nearest 0.1 mm. 
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Activating positive memory engrams suppresses 


depression-like behaviour 


Steve Ramirez, Xu Liut, Christopher J. MacDonald!, Anthony Moffa', Joanne Zhou", Roger L. Redondo? & Susumu Tonegawal* 


Stress is considered a potent environmental risk factor for many 
behavioural abnormalities, including anxiety and mood disor- 
ders’*. Animal models can exhibit limited but quantifiable beha- 
vioural impairments resulting from chronic stress, including 
deficits in motivation, abnormal responses to behavioural 
challenges, and anhedonia**. The hippocampus is thought to 
negatively regulate the stress response and to mediate various cog- 
nitive and mnemonic aspects of stress-induced impairments**”, 
although the neuronal underpinnings sufficient to support beha- 
vioural improvements are largely unknown. Here we acutely rescue 
stress-induced depression-related behaviours in mice by optogen- 
etically reactivating dentate gyrus cells that were previously active 
during a positive experience. A brain-wide histological investiga- 
tion, coupled with pharmacological and projection-specific opto- 
genetic blockade experiments, identified glutamatergic activity in 
the hippocampus-amygdala-nucleus-accumbens pathway as a 
candidate circuit supporting the acute rescue. Finally, chronically 
reactivating hippocampal cells associated with a positive memory 
resulted in the rescue of stress-induced behavioural impairments 
and neurogenesis at time points beyond the light stimulation. 
Together, our data suggest that activating positive memories arti- 
ficially is sufficient to suppress depression-like behaviours and 
point to dentate gyrus engram cells as potential therapeutic nodes 
for intervening with maladaptive behavioural states. 

Our recent studies have demonstrated that dentate gyrus cells that 
express c-Fos during fear or reward conditioning define an active 
neural population that is sufficient to elicit both aversive and appetitive 
responses, and that the mnemonic output elicited by these artificially 
reactivated cells can be updated with new information®*. These find- 
ings raise the possibility of alleviating stress-induced behavioural 
impairments via a defined set of dentate gyrus cells that are active 
during a positive experience. Indeed, how positive episodes interact 
with psychiatric-disease-related behavioural states, including depres- 
sion-related impairments, at the neuronal and systems level remains 
largely unknown, despite the promising cognitive treatments available 
in humans’. 

To address this issue, we used our recently developed method that 
enables labelling and manipulation of memory engram cells (see 
Methods)**. Exposing animals that were taken off doxycycline to a 
naturally rewarding experience’ (that is, exposure to a female mouse in 
a modified home cage, hereafter referred to as a ‘positive experience’ 
and further validated in Extended Data Fig. 1), a neutral context (here- 
after referred to as a ‘neutral experience’), or a single bout of immob- 
ilization stress (hereafter referred to as a ‘negative experience’) all 
elicited comparable levels of ChR2-mCherry expression in the dentate 
gyrus (Extended Data Fig. 2a-e). 

As shown in Fig. 1a, mice were split into six groups (see Methods). 
After 10 days of chronic immobilization stress (CIS) (Extended Data 
Fig. 2f) or in a home cage, all groups were put through the open field 
test (OFT) and elevated plus maze test (EPMT) as measures of anxiety- 
like behaviours, as well as the tail suspension test (TST) as a measure of 


active/passive escape behaviour in response to a challenging situation, 
and the sucrose preference test (SPT) for anhedonia’*“. In unstressed 
animals, optogenetic reactivation of cells previously active during a 
positive experience did not significantly change anxiety-related mea- 
sures, time spent struggling, or preference for sucrose compared to 
unstressed mCherry controls (Fig. 1b-e). In the stressed groups, the 
CIS paradigm elicited a robust decrease in time struggling and pref- 
erence for sucrose, as well as increased anxiogenic responses, consist- 
ent with previous reports’** (Fig. 1b-e). 

However, optically reactivating dentate gyrus cells that were prev- 
iously active during a positive experience, but not a neutral or a nega- 
tive experience, in stressed animals acutely increased time struggling 
and sucrose preference to levels that matched the unstressed group’s 
behaviour (Fig. 1b, c). Additionally, optical reactivation of dentate 
gytus cells associated with a positive experience decreased the latency 
to feed in a novelty-suppressed feeding test (NSFT)'* (Extended Data 
Fig. 3a) without affecting hunger or satiety (Extended Data Fig. 3b). 
Once again, the CIS paradigm had an anxiogenic effect across all 
groups, and all groups failed to show light-induced behavioural 
changes in the OFT or EPMT (Fig. 1d, e). Similarly, total distance 
travelled was consistent across groups (Extended Data Fig. 4c). 
Taken together, these data argue that reactivating dentate gyrus cells 
labelled by a positive experience is sufficient to acutely reverse the 
behavioural effects of stress in the TST, SPT and NSFT. 

To identify potential neural loci that mediate the light-induced 
reversal of the stress-induced behaviours observed in our experiments, 
all subjects first underwent the CIS protocol and then were exposed to 
the TST while dentate gyrus cells previously active during a positive 
experience were optically reactivated. We then performed a brain-wide 
mapping of c-Fos expression in areas activated by this treatment 
(Fig. 2a). 

Optical reactivation of dentate gyrus cells labelled by a positive 
experience correlated with a robust increase of c-Fos expression in 
several brain areas, including the nucleus accumbens (NAcc) shell, 
lateral septum, basolateral amygdala (BLA), central amygdala, as well 
as the dorsomedial, ventromedial, and lateral hypothalamus (Fig. 2b-i 
and Extended Data Fig. 5a, b), but not in the medial prefrontal cortex 
(mPFC) (Fig. 2j-m) or in several other loci (Extended Data Fig. 5c-e). 
Furthermore, we monitored single-unit activity in the BLA of mice 
while simultaneously activating dentate gyrus positive memory- 
engram cells with blue light and found that ~8% of cells (9/106; 
n = 3 mice) had excitatory (8/9 cells) or inhibitory (1/9 cells) responses 
(Extended Data Fig. 4a). A parallel set of experiments in which 
unstressed animals received optical stimulation of dentate gyrus cells 
revealed mostly similar patterns of c-Fos expression (Extended Data 
Fig. 6). 

The NAcc has been heavily implicated in stress responses, mood 
disorders, and processing natural rewards**"°"'*">*°, Moreover, patho- 
physiological dysfunction of the NAcc in response to various stressors 
has been implicated in anhedonia and reward conditioning’””°. Our 
within-subject experiments revealed that, in the TST, the behavioural 
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Figure 1 | Activating positive memory engrams rescues depression-related 


behaviour. a, Behaviour schedule and groups used. Dox, doxycycline. 
Female symbols represent exposure to a female conspecific, white hexagons 
represent neutral contexts, and mice in the ‘stress’ condition are depicted 
undergoing an immobilization protocol. b-e, Optical reactivation of dentate 
gyrus cells that were previously active during a positive experience significantly 
increases time struggling in the tail suspension test (b) and preference for 
sucrose (c), but does not have a significant effect in anxiety-like behaviour in the 
open field test (d) or elevated plus maze test (e). A two-way analysis of variance 
(ANOVA) with repeated measures revealed a group-by-light epoch interaction 
in the TST (Fs,294 = 21.20, P < 0.001) or SPT (Fs,196 = 6.20, P < 0.001) 
followed by Bonferroni post hoc tests, which revealed significant increases in 
struggling or preference for sucrose in the positive memory plus stress group. 
#P < 0.01. # used to denote significant differences between the four stressed 
groups (n = 18 per group) versus the two non-stressed groups (n = 16 per 
group); *P < 0.05, **P < 0.01 (orange asterisks used to denote significant 
differences between the stress plus positive memory group versus the other 
three stressed groups). Data are means + s.e.m. 


effects of optically reactivating dentate gyrus cells labelled by a positive 
experience were blocked in the group of mice that concurrently received 
the glutamate receptor antagonists NBQX and AP5 in the NAcc, but not 
in the group that received saline, without altering basal locomotion 
(Extended Data Fig. 4b, c). Blocking dopaminergic activity yielded a 
similar blockade of the dentate gyrus light-induced effects (Extended 
Data Fig. 7a). 

The BLA is known to have robust glutamatergic inputs to the 
NAcc’, and previous studies have implicated BLA projections to the 
NAcc in enabling reward-seeking behaviour'’. We therefore investi- 
gated whether the hippocampus (dentate gyrus)-BLA-NAcc func- 
tional pathway is crucial for the real-time light-induced rescue of 
depression-related behaviour. Our transgenic mice were bilaterally 
injected with TRE-ArchT-eGFP into the BLA to allow for activity- 
dependent ArchT-eGFP labelling of axonal terminals from the BLA to 
the NAcc in response to a positive experience”’ (Fig. 3a, b). Optic fibres 
were bilaterally placed over the NAcc and the dentate gyrus to allow for 
real-time inhibition of these terminals originating from ~ 18% (Fig. 3c) 


336 | NATURE | VOL 522 |18 JUNE 2015 


ma Positive memory + stress (ChR2) 
© Neutral memory + stress (ChR2) 
mm Positive memory + stress (mCherry) 


2 
2: 
& 
x2) 
zo) 
oO 
+ 
on 
fo) 
oe 
Positive Neutral J} mCherry. = 
* BD 250 
Positive Neutral’ J mCher o 
ry = 200 ek * 
BLA ‘» 150 
oO T 
© 100 x 
7) 
8 50 
ns 
6 0 
Positive } Neutral /’mCherry BLA CeA 
mPFC. 
2 NS 
2 
i= 
‘a == 
oO 
oO 
+ 
no 
(eo) 
Be 
Positive Neutral || mCherry - mPFC 


Figure 2 | Positive memory reactivation increases c-Fos expression in the 
nucleus accumbens shell and the amygdala. a, Brain diagram illustrating 
target areas analysed. b-d, Activation of a positive memory, but not a neutral 
memory or mCherry only, in the dentate gyrus during the TST elicits robust 
c-Fos expression in the nucleus accumbens shell (b), basolateral amygdala, and 
central amygdala (c), but not in the medial prefrontal cortex (d). For 
histological data, a one-way ANOVA followed by a Bonferroni post hoc test 
revealed a significant increase of c-Fos expression in the positive memory plus 
stress group relative to controls in the NAcc and amygdala, but not the mPFC 
(NAcc shell, Fs,39 = 15.2, P < 0.01; BLA, Fo,39 = 11.71, P < 0.01; central 
amygdala, Fy 39 = 11.45, P < 0.05; mPFC, Fy 39 = 1.33, P = 0.294.n = 6 
animals per group, 3-5 slices per animal). NS, not significant; *P < 0.05, 
**P < 0.01. Data are means + s.e.m. Scale bars correspond to 100 tm. HPC, 
hippocampus; LH, lateral habenula; LS, lateral septum; Hyp., hypothalamus. 


of BLA neurons and simultaneous activation of ChR2-mCherry- 
positive dentate gyrus cells, respectively, in stressed mice. At the neur- 
onal level, light-induced reactivation of dentate gyrus cells previously 
activated by a positive experience also reactivated BLA*® and NAcc’*, 
but not mPFC, cells (that is, endogenous c-Fos” cells, red) previously 
activated by the same positive experience (that is, ArchT-eGFP* cells, 
green) (Fig. 3c). These results suggest that the dentate gyrus engram 
cells are functionally connected to BLA engram cells and NAcc engram 
cells. At the behavioural level, inhibition of BLA terminals onto the 
NAcc blocked the dentate gyrus light-induced rescue in both the TST 
and SPT (Fig. 3d-g). Within the same behavioural session for the TST, 
and across 2 days for the SPT, when ArchT-mediated inhibition was 
released (that is, the green light was turned off), the rescue effects of 
reactivating dentate gyrus cells previously active during a positive 
experience were rapidly observed in all groups (Fig. 3d-g). ArchT- 
mediated inhibition of BLA-NAcc terminals alone did not negatively 
affect behaviour in the TST or SPT beyond the levels of the stressed 
animals (Fig. 3d—g insets). The specificity of the hippocampus (dentate 
gyrus)-BLA-NAcc pathway for the rescue was supported by an ana- 
logous experiment conducted with bilateral injections of TRE-ArchT- 
eGFP into the mPFC. Although the mPFC is also known to provide 
robust glutamatergic input to the NAcc”, the induction of c-Fos 
expression in this area upon optogenetic activation of dentate gyrus 
cells associated with a positive experience was not significantly higher 
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than that observed with a neutral experience (Fig. 2m), and mPFC cells 
reactivated by dentate gyrus cell reactivation was at chance level 
(Fig. 3c). Correspondingly, inhibition of terminals originating from 
~12% of the mPFC (Fig. 3c) onto the NAcc did not block the dentate 
gyrus light-induced rescue in either the TST or SPT (Fig. 3d-g). 
Moreover, inhibition of BLA, but not mPFC, terminals onto the 
NAcc partially inhibited the dentate-gyrus-mediated, light-induced 
increase of c-Fos” cells observed in the NAcc shell (Fig. 3h), supporting 
the conclusion that the hippocampal (dentate gyrus)-BLA-—NAcc path- 
way of positive engrams plays a crucial role in the rescue of depression- 
related behavioural phenotypes. 
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Recent meta-analyses have suggested that treating psychiatric 
disorders through prescribed medication or cognitive interventions 
are capable of producing symptom remission when administered 
chronically”®, though the neural underpinnings inducing and correl- 
ating with long-lasting rescues are poorly understood”. The afore- 
mentioned acute intervention did not induce enduring behavioural 
changes (Extended Data Fig. 7b). We therefore investigated whether 
chronic reactivation of dentate gyrus engram cells could attenuate 
depression-related behaviours in a manner that outlasted acute optical 
stimulation following the protocol depicted in Fig. 4a (Methods). A 
group in which dentate gyrus cells associated with a positive experi- 
ence were optically reactivated across 5 days, but not 1-day or no 
stimulation groups, showed a reversal of the stress-induced beha- 
vioural deficits measured in the TST and SPT that was not significantly 
different from an unstressed control group (Fig. 4b, c). A group 
in which dentate gyrus cells associated with a neutral experience 
were optically reactivated across 5 days did not show such effects, 
nor did a group that was exposed to a natural social reward for 5 days 
(Fig. 4b-d). Histological analyses revealed decreased levels of neuro- 
genesis as measured both by the polysialylated neuronal cell adhesion 
molecule (PSA-NCAM) and doublecortin (DCX)—often considered 
markers of developing and migrating neurons**—in all stressed 
groups except for the positive experience and 5-day stimulation group, 
and the unstressed control group (Fig. 4d and Extended Data Fig. 8). 
This increase in adult-born neurons positively correlated with the 
degree to which each group preferred sucrose in the SPT (Extended 
Data Fig. 9a); moreover, performance levels on the SPT and TST 
positively correlated with one another on an animal-by-animal basis 
(Extended Data Fig. 9b). 

Our data demonstrate that the depression-related readouts of 
active/passive coping-like behaviour and anhedonia, as measured in 
the TST and SPT, respectively, can be ameliorated by activating cells in 
the hippocampus associated with a positive memory, while anxiety- 
related behaviours measured by the OFT and EPMT remained 
unchanged. Differential regulation of depression- and anxiety-related 
behaviour could have been achieved by leveraging the functional 


Figure 3 | The antidepressant effects of an optically activated positive 
memory require real-time terminal activity from the BLA to the NAcc. 

a, Brain diagram illustrating target areas manipulated. b, Representative 
coronal slices showing TRE-ArchT-eGFP-positive cells in the BLA or mPFC, 
as well as their corresponding terminals in the NAcc. Scale bars: BLA and 
mPFC, 500 ttm; NAcc, 200 jum. ¢, Animals were taken off Dox and initially 
exposed to a positive experience, which caused labelling of corresponding 
BLA (~18%), mPFC (~12%), or NAcc (~9%) cells with eGFP derived from 
AAV.-TRE-ArchT-eGFP (green, halo-like expression). Light-activation of a 
positive memory engram in the dentate gyrus (DG) preferentially reactivated 
the BLA and NAcc shell cells, as measured by endogenous c-Fos expression 
(red, nucleus-localized), that were originally labelled by the same positive 
experience, while groups with no light stimulation showed levels of overlap not 
significantly different from chance. Arrowheads indicate double-stained cells. 
Scale bar, 5 um. d-g, ArchT-mediated inhibition of BLA, but not mPFC, 
terminals in the NAcc prevents the dentate-gyrus-mediated light-induced 
increases in struggling (d, e) or preference for sucrose (f, g), while inhibition of 
BLA terminals in the NAcc without dentate gyrus stimulation does not affect 
behaviour (insets). h, ArchT-mediated inhibition of BLA, but not mPFC, 
terminals prevents the dentate-gyrus-mediated light-induced increase of c-Fos 
expression in the NAcc. For behavioural data, a two-way ANOVA with 
repeated measures followed by a Bonferroni post hoc test revealed a group-by- 
light epoch interaction and significant ArchT-mediated attenuation of 
struggling in the TST (d: Fy,.99 = 7.30, P < 0.001; e: Fy99 = 6.61, P< 0.01) or 
preference for sucrose water in the the SPT (ft Fre. = 10.66, P< 0.01).n = 12 
per behavioural group. *P < 0.05, **P < 0.01, ***P < 0.001; orange asterisks 
used to denote significant differences between the stress plus positive memory 
group versus all other groups. For histological data, one-sample f-tests against 
chance overlap were performed (n = 4 per group, 3-5 slices per animal). NS, 
not significant. HPC, hippocampus; LH, lateral habenula; LS, lateral septum; 
Hyp., hypothalamus. Data are means ~ s.e.m. 
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Figure 4 | Chronic activation of a positive memory elicits a long-lasting 
rescue of depression-related behaviour. a, Behavioural schedule and groups 
used. NoStim, no stimulation. Female symbols represent exposure to a female 
conspecific, white hexagons represent neutral contexts, and mice in the ‘stress’ 
condition are depicted undergoing an immobilization protocol. b, c, Animals in 
which a positive memory was reactivated twice a day for 5 days showed 
increased struggling in a 6-min tail suspension test (F573 = 3.34, P < 0.05) 
(b) and increased preference for sucrose measured over 24 h (Fs,g4 = 6.25, 
P< 0.01) (c). d, The 5-day positive memory stimulation group showed a 
significant increase of adult newborn cells in the dentate gyrus as measured by 
PSA-NCAM‘ cells (Fs,72 = 4.65, P < 0.01; see Extended Data Fig. 8 for 
doublecortin data and PSA-NCAM images). For these data (b-d), a one-way 
ANOVA revealed a significant interaction of the experimental-group factor 
and stimulation-condition factor and was followed by a Bonferroni post 

hoc test. n = 14 per TST behavioural group, n = 15 per SPT behavioural 
group, = 5 slices per animal for data appearing in d. *P < 0.05. Data are 
means + s.e.m. 


segregation present along the hippocampus dorsal-ventral axis; for 
instance, activation of ventral hippocampal dentate gyrus engram cells 
could reveal heterogeneous, behaviourally relevant roles in the emo- 
tional regulation of anxiety and stress responses that our dorsal hip- 
pocampus manipulations presumably did not access**”’”. To that end, 
we speculate that, at the engram level, the circuitry sufficient to modu- 
late anxiety-related behaviour relies more heavily on a synaptic dia- 
logue within the amygdala, its bidirectional connections with the 
ventral hippocampus, and its effects on downstream mesolimbic and 
cortical structures'®"!?°7, 

Depression is diagnosed as a constellation of heterogeneous 
symptoms; their complex aetiology and pathophysiology underscore 
the varied responses to currently available treatments. While most 
psychopharmacological treatments take weeks to achieve effects, other 
alternative treatments such as deep-brain stimulation and the NUDA 
antagonist ketamine have been reported to have rapid effects in a 
subset of patients”*. In rodents, optogenetic stimulation of mPFC neu- 
rons, mPFC to raphe projections, and ventral tegmental dopaminergic 
neurons achieved a rapid reversal of stress-induced maladaptive 
behaviours*’*”*. We speculate that our acute behavioural changes 
reflect the degree to which directly stimulating positive-memory- 
engram-bearing cells might bypass the plasticity that normally takes 
antidepressants weeks or months to achieve, thereby temporarily 
suppressing the depression-like state. In support, we observed that 
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the effects of optically stimulating a positive memory are contingent 
on active glutamatergic projections from the amygdala to the NAcc in 
real time, as well as intra- NAcc dopamine activity™®. Our data dovetail 
with this circuit’s proposed role of relaying BLA stimulus-reward 
associations to a ventral striatal motor-limbic interface. This interface 
is thought to be capable of coalescing such information with motiva- 
tional states and finally translating such activity into behaviourally 
relevant outputs”’7"’. 

Moreover, our chronic stimulation data reveal that repeatedly activ- 
ating dentate gyrus engram cells associated with a positive experience 
elicits an enduring reversal of stress-induced behavioural abnormal- 
ities and an increase in neurogenesis. While future experiments are 
required to identify the causal link between chronically reactivated 
positive memory engrams and the corresponding rescue of behaviours, 
many tantalizing hypotheses surface, including a normalization of 
VTA firing rates”, epigenetic and differential modification of effector 
proteins (for example, CREB, BDNF) in areas upstream and down- 
stream of the hippocampus”, and a reversal of neural atrophy in areas 
such as CA3 and mPFC or hypertrophy in BLA”*. The aforementioned 
molecular and homeostatic mechanisms—in addition to our observed 
increase of adult-born neurons in the 5-day stimulation group—could 
be partly realized in a hormone- or neuromodulator-mediated manner 
(Extended Data Fig. 5). Finally, our data demonstrate that exposing 
stressed subjects to a natural positive experience repeatedly is not 
effective, while repeated direct reactivations of dentate gyrus engram 
cells associated with a previously acquired positive memory is effective 
(Fig. 4b-d). We speculate that invasively stimulating these dentate 
gytus cells is effective in activating both the internal contextual rep- 
resentation associated with a positive experience as well as associated 
downstream areas, while exposure to natural exogenous positive cues 
may not be able to access similar neural pathways in subjects display- 
ing depression-like symptoms such as passive behaviour in challen- 
ging situations and anhedonia (Fig. 4b-d). 

Collectively, the data described here build a novel experimental 
bridge between memory engrams in the brain and animal models of 
psychiatric disorders. We propose that direct activation of dentate 
gyrus engram cells associated with a positive memory offers a potential 
therapeutic node for alleviating a subset of depression-related beha- 
viours and, more generally, that directly activating endogenous neur- 
onal processes may be an effective means to correct maladaptive 
behaviours. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Subjects. The c-fos-tTA mice were generated by crossing TetTag*' mice with 
C57BL/6J] mice and selecting for those carrying the c-fos-tTA transgene. 
Littermates were housed together before surgery and received food and water 
ad libitum. All mice were raised on a diet containing 40 mg kg’ doxycycline 
(Dox) for a minimum of 1 week before receiving surgery at age 12-16 weeks. Post- 
operation, mice were individually housed in a quiet home cage with a reverse 12 h 
light-dark cycle, given food and water ad libitum, and allowed to recover for a 
minimum of 2-3 weeks before experimentation. All animals were taken off Dox 
for an undisturbed 42 h to open a time window of activity-dependent labelling. In 
our system, the promoter of c-Fos—an immediately early gene often used as a 
marker of recent neural activity—is engineered to drive the expression of the 
tetracycline transactivator (tTA), which in its protein form binds to the tetracy- 
cline response element (TRE). Subsequently, the activated TRE drives the light- 
responsive channelrhodopsin-2 (ChR2). Importantly, the expression of ChR2 only 
occurs in the absence of doxycycline (Dox) from the animal’s diet, thus permitting 
inducible expression of ChR2 in correspondingly active cells. 

Each group of male mice was exposed to all three subsequent treatments for 2 
hours and randomly assigned which experience would occur while off Dox; a 
negative experience (that is, a single bout of immobilization stress, see below), 
a naturally rewarding experience (that is, exposure to a female conspecific while in 
a modified home cage, as previously reported*’), and a neutral experience (that is, 
exposure to a conditioning chamber). For female exposure, single-caged male mice 
were moved to a behaviour room distinct from the housing room and with dim 
lighting conditions. Next, the cage tops were removed and a 4-sided (31 X 25 X 30 
cm) white box was placed over the home cage, after which a female mouse was 
introduced to the home cage. Importantly, this modification to the home cage 
during female exposure ensured similar levels of dentate gyrus labelling as the 
neutral and negative memory exposure groups (Extended Data Fig. 2). Each group 
was taken off Dox only during one of the aforementioned treatments and placed 
back on Dox immediately afterwards. The subjects were age-matched and split 
into two groups: a stressed group and a non-stressed group. Non-stressed animals 
remained in their home cages before experimentation. Stressed animals under- 
went 2-3 h of chronic immobilization stress (CIS) each day for ten consecutive 
days before behavioural testing using Mouse DecapiCone disposable restrainers. 
All procedures relating to mouse care and treatment conformed to the institutional 
and National Institutes of Health guidelines for the Care and Use of Laboratory 
Animals. Sample sizes were chosen on the basis of previous studies’; variance 
was similar between groups for all metrics measured. No statistical methods were 
used to predetermine sample size. 

Virus constructs and packaging. The pAAVo-TRE-ChR2-mCherry and pAAVo- 
TRE-mCherry plasmids were constructed as previously reported*’. The pAA 
Vo-TRE-ArchT-eGFP was constructed by replacing the ChR2-eYFP fusion gene 
in the pAAVo-TRE-ChR2-eYFP plasmid from Liu et al.** with a fusion gene of 
ArchT-eGFP from Han et al.**. These plasmids were used to generate AA Vo viruses 
by the Gene Therapy Center and Vector Core at the University of Massachusetts 
Medical School. Viral titrations were 8 X 10'? genome copy per ml for AAVy- 
TRE-ChR2-mCherry, 1.4 X 10'* genome copy per ml for AAVo-TRE-mCherry, 
and 0.75 to 1.5 X 10'* genome copy per ml for AAVo-TRE-ArchT-eGEP. 
Stereotactic injection, cannulation, and fibre optic implants. All surgeries were 
performed under stereotaxic guidance and subsequent coordinates are given rela- 
tive to bregma. Animals were anaesthetized using 500 mg kg‘ Avertin before 
receiving bilateral craniotomies using a 0.5 mm diameter drill bit at -2.2 mm 
anterioposterior (AP), +1.3 mm mediolateral (ML) for dentate gyrus injections. 
All mice were injected with 0.15 pl of AAV9 virus at a controlled rate of 0.6 pl 
min”! using a mineral oil-filled glass micropipette joined by a microelectrode 
holder (MPH6S; WPI) to a 10 pl Hamilton microsyringe (701LT; Hamilton) in 
a microsyringe pump (UMP3; WPI). The needle was slowly lowered to the target 
site at -2.0 mm dorsoventral (DV). The micropipette remained at the target site for 
another 5 minutes post-injection before being slowly withdrawn. A bilateral 
optical fibre implant (200 um core diameter; Doric Lenses) was lowered above 
the injection site (-1.6 mm DV for dentate gyrus) and three jewellery screws were 
secured into the skull at the anterior and posterior edges of the surgical site to 
anchor the implant. For mice used in pharmacological manipulations, bilateral 
guide cannula (PlasticsOne) were implanted above the NAcc (+ 1.2 mm AP; 0.5 
mm ML; -3.25 mm DV). Mice used in the BLA-to-NAcc or mPFC-to-NAcc 
experiments received bilateral injections (0.2 pl to 0.3 pl) of TRE-ArchT-eGFP 
or TRE-eGFP into the BLA (-1.46 mm AP; +3.20 mm ML; -4.80 mm DV), NAcc 
(+1.2 mm AP; +£0.50 mm ML; —4.3 mm DV), or the mPFC (+1.70 mm AP; 
+£0.35 mm ML; -2.70 mm DV). These mice were then injected with TRE-ChR2- 
mCherry into the dentate gyrus and received bilateral optic fibre implantation as 
described above (Doric Lenses), as well as bilateral optic fibre implantation over 
the NAcc (+1.2 mm AP; £0.50 mm ML; -3.70 mm DV). 


Layers of adhesive cement (C&B Metabond) followed by dental cement (Teets 
cold cure; A-M Systems) were spread over the surgical site and protective cap to 
secure the optical fibre implant. The protective cap was made from the top portion 
of a black polypropylene microcentrifuge tube. Mice received intraperitoneal 
injections of 1.5 mg kg~' analgesics and were placed on heating pads throughout 
the procedure until recovery from anaesthesia. Histological studies were used to 
verify fibre placements and viral injection sites. Only data from mice with opsin or 
fluorophore expression restricted to the dentate gyrus, BLA or mPFC were used for 
histological, behavioural and statistical analyses. 

Pharmacological infusion of glutamate or dopamine receptor antagonists. 
Glutamate antagonists were bilaterally infused into the NAcc as follows: 0.2 ul 
per hemisphere of NBQX at a concentration of 22.3 mM to antagonize AMPA 
(o-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptors and 0.2 pl 
per hemisphere of AP5 at a concentration of 38.04 mM to antagonize NMDA 
(N-methyl-p-aspartate) receptors. Dopamine receptor antagonists were bilaterally 
infused into the NAcc as follows: 0.2 1] SCH23390 at a concentration of 6.16 mM 
to antagonize D1-like receptors and 0.2 ul raclopride at a concentration of 2.89 
mM to antagonize D2-like receptors. A 26-gauge stainless steel double internal 
cannula (PlasticsOne) was used to bilaterally infuse each drug; the internal cannula 
was connected with a microsyringe pump by a PE20 tube to control the injection 
rate at 100 nl min’. The injection cannula was left connected for 5 min before 
removal to allow for diffusion. Finally, all behaviour was performed 20 min fol- 
lowing drug infusion. 

Immunohistochemistry. Mice were overdosed with 750-1000 mg kg’ Avertin 
and perfused transcardially with cold PBS, followed by 4% paraformaldehyde 
(PFA) in PBS. Extracted brains were kept in 4% PFA at 4°C overnight, then 
transferred to PBS. A vibratome was used to recover 50-l1m coronal slices in cold 
PBS. Slices were washed with PBS-T (PBS + 0.2% Triton X-100), then incubated 
with PBS-T + 5% normal goat serum at 4°C for 1 h for blocking. For immuno- 
staining, slices were incubated with one or more primary antibodies (1:1000 
dilution) at 4°C for 24 h (600-401-379 Rockland; A10262, Invitrogen; SC-52, 
Santa Cruz). Three washes of PBS-T for 10 min each were performed on the slices 
before 1 h incubation with secondary antibody at 1:200 dilution (A11039, 
Invitrogen; A21429, Invitrogen). Slices were washed three more times in PBS-T 
for 10 min each, stained with 4’,6-diamidino-2-phenylindole (DAPI; 1:10,000 
dilution) to label cell nuclei and mounted with Vectashield H-1200 onto micro- 
scope slides. 

Behavioural assays. All behaviour assays were conducted during the light cycle of 
the day (7:00-19:00) on animals 12-16 weeks old. Mice were handled for 3-5 days, 
2 min per day, before all behavioural experiments. 

Tail suspension test. Fibre optic implants on experimental mice were plugged 
into a patch cord before the tail suspension test. Each subject was hung by its tail 
from a bar 40 cm from the ground with a single piece of autoclave tape. The animal 
was positioned such that it had no contact with other objects. Immediately after 
positioning, video recordings of the animal’s movements were taken (Noldus by 
Ethovision). Blue light stimulation was given at 20 Hz, 15 ms pulse width, ~ 15-20 
mW. For behavioural data appearing in Fig. 1, all mice were exposed to a 9 min tail 
suspension test with light stimulation occurring at minutes 3-5, inclusive; for 
histological data appearing in Fig. 2, all mice were exposed to a 6 min tail suspen- 
sion test with light stimulation occurring throughout the entire session using the 
same stimulation parameters described above. For data appearing in Fig. 3, all 
animals were given a 9 min tail suspension test once a day for 2 days to assess the 
effects of ArchT inhibition on BLA or mPFC terminals in the NAcc while simul- 
taneously activating ChR2-positive cells in the dentate gyrus. For half of the sub- 
jects, on day 1, ArchT-mediated inhibition occurred during minutes 3-5, 
inclusive, using constant green light at ~25 mW; dentate gyrus stimulation 
occurred from minutes 3-8, inclusive. For the other half, ArchT-mediated inhibi- 
tion occurred during minutes 6-8, inclusive; and dentate gyrus stimulation 
occurred from minutes 3-8, inclusive. The treatments occurring on days 1 and 
2 were counterbalanced within and across groups. A separate cohort of animals 
were used for the data appearing in the insets of Fig. 3d-g. These groups contained 
TRE-ChR2-mCherry in the dentate gyrus, as well as bilateral optic fibres over the 
dentate gyrus, and TRE-ArchT-eGFP in the BLA, as well as optic fibres over the 
NAcc to inhibit BLA terminals during the appropriate light-on epochs in the TST 
and SPT. These cohorts, too, were counterbalanced across sessions and only 
received green light over the NAcc for 3 min during the TST or 15 min during 
the SPT. For the c-Fos counts appearing in Fig. 3h, all groups underwent a 6 min 
tail suspension test with blue light delivered to the dentate gyrus and green light 
delivered to the NAcc throughout the entirety of the session. These groups were 
sacrificed 1.5 h later for histological analyses. For data appearing in Fig. 4, mice 
were exposed to a 6 min tail suspension test without light stimulation. An experi- 
menter blind to each mouse condition and light treatment scored all the tail 
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suspension videos by measuring the total time in seconds that each mouse spent 
struggling throughout the protocol. 

Sucrose preference test. A Med Associates operant chamber—equipped with 
photolickometers placed on two separate corners of the chamber—was used to 
count the number of licks made by the mice on lick spouts with direct access to 2% 
sucrose water solution or water alone. All animals undergoing the sucrose pref- 
erence protocol were water-restricted for 36 h before each habituation session. 
These sessions consisted of first plugging the optic fibres on the water-deprived 
mice to a corresponding patch cord and exposing the mice to the operant chamber, 
which contained bottles filled only with water. Each exposure occurred on three 
separate days for 30 min per day. The three habituation sessions occurred inter- 
spersed throughout the 10-day chronic immobilization stress protocol (that is, on 
days 1, 4 and 7 of stress) at least 6 h before or after the stress protocol. In pilot 
experiments, ~90% of water-deprived animals failed to sample both photolick- 
ometers in the operant chamber even after multiple 30-min habituation sessions 
(data not shown); to address this issue, a glove box was inserted on its side in the 
operant chamber such that each subject had a narrow ~10 cm corridor to explore 
and find each lick spout. With this modification, >90% of animals found both lick 
spouts during the first and subsequent habituation sessions. Upon completing a 
habituation session, mice were given water only when 2 h of being placed back into 
the home cage had elapsed. On the test day (that is, the day on which optical 
stimulation occurred), the location of each sucrose or water bottle in the chamber 
was counterbalanced between animal chambers. A 30 min protocol—15 min light 
off, 15 min light on—was used on all animals. The first 15 min were used to detect 
the baseline preference; blue light stimulation at 20 Hz, 15 ms pulse width, ~ 15-20 
mW, occurred during the second 15 min epoch to detect light-induced changes in 
preference. For data appearing in Fig. 3, water-deprived animals were exposed to 
the same 30-min protocol on two separate days. On day 1, after the first 15-min 
epoch, half of the animals received constant green light stimulation at ~15 mW (as 
previously reported**) over the NAcc while simultaneously receiving blue light 
stimulation over the dentate gyrus; the other half received only blue light stimu- 
lation over the dentate gyrus. On day 2, the treatments were reversed in a counter- 
balanced manner. Data was only collected in animals that licked at both spouts in 
the first 15-min interval; animals that did not discover both lick spouts (as evi- 
denced by licking only one spout during the first 15-min interval) were not given 
light stimulation, the experiment was terminated early, and the test was repeated 
the following day. Sucrose preferences were calculated as follows: 


total number of licks to sucrose spout 


total number of licks to sucrose spout +total number of licks to water spout ae 


For the sucrose preference data appearing in Fig. 4, mice were first habituated to 
two water bottles for 2 days in their home cages. On day 3, two water bottles 
containing either 2% sucrose or water were placed into the cages in a counter- 
balanced manner and left undisturbed for 24 h. Sucrose preferences were calcu- 
lated as follows: 


Aweight of sucrose water 


Aweight of sucrose water + Aweight of water aa 


Open field test. An open, metal chamber (Accuscan system) with transparent, 
plastic walls was used for the open field test. Implanted mice were plugged into a 
patch cord, individually placed into the chamber, and allowed to explore freely for 
12 min. An automated video-tracking system (Ethovision by Noldus) was used to 
track the amount of time spent in the centre of the chamber compared to the edges, 
as well as the total distance travelled across a session. Light stimulation, as 
described above, was given during minutes 3-5 and 9-11, inclusive. 

Elevated plus maze test. Implanted animals were plugged into a corresponding 
patch cord before the beginning of the session and subsequently placed in an 
elevated plus maze. Two pieces of plastic (30 cm long, 5 cm wide) formed the 
two arms of the maze that intersected at right angles. One arm was enclosed with 
plastic black walls, and the other arm was open with no walls. The structure was 
elevated 60 cm above the floor and mice were placed one at a time at the inter- 
section of the maze facing into an arm with walls to start a trial. Video tracking 
software (EthoVision by Noldus) was used to track the amount of time the mice 
spent in the enclosed versus the open arms of the maze throughout a 15-min 
session. Optical stimulation occurred only during the second 5-min epoch using 
the same stimulation parameters as noted above. 

Novelty-suppressed feeding. The novelty suppressed feeding paradigm was per- 
formed as previously described’’. In brief, food was removed from the subjects’ 
home cages 24 h before testing. The next day, mice were placed for 10 min in an 
open field apparatus containing bedding with a food pellet at the centre ona 1 cm? 
elevated platform. Light stimulation using the parameters described above 
occurred throughout the entire session. All behaviour was videotaped 
(Ethovision by Noldus) and latency to feed was scored offline by an experimenter 
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blind to the experimental conditions for each mouse. Once placed back into their 
home cages, mice were given a single food pellet, which was weighed before and 
after a 5-min test to measure for motivation/hunger effects on feeding behaviour 
compared to feeding in a novel environment. 

5-day stimulation protocol. For data appearing in Fig. 4, animals were first split 
into six groups: a group in which dentate gyrus cells previously active during a 
positive experience were reactivated twice a day for 5 days (5-day group) after 
the CIS protocol, a group in which such stimulation occurred twice a day for 1 
day (1-day group) after the CIS protocol, a group in which no stimulation was 
delivered (NoStim group) after the CIS protocol, a group in which dentate gyrus 
cells previously active during a neutral experience were reactivated twice a day 
for 5 days (Neutral group) after the CIS protocol, a group that did not receive the 
CIS stress protocol but still had dentate gyrus cells previously active during a 
positive experience reactivated twice a day for 5 days (NoStress group), and 
finally, a group that was exposed to a natural social reward (that is, female 
mouse) twice a day for 5 days (Natural group). Optical stimulation first occurred 
at 10:00 for 15 min (blue laser, 20 Hz, 15 ms pulse width, ~15-20 mW) as 
animals explored an operant chamber, and then again at 15:00 for 15 min using 
the same conditions. The same behavioural schedule was performed for the 
Natural group. All groups were exposed for an equal amount of time to each 
chamber, plugged into a corresponding patch cord, and optical stimulation 
occurred only in the appropriate groups. Each chamber contained dim lighting, 
white plastic floors, and no artificial odorants. One day after the final stimu- 
lation, all groups were exposed to a 6 min tail suspension or 24 h sucrose 
preference test as described above. 

Object-female association. Twenty-four wild-type B6 mice were divided in two 
groups (neutral-object group, that is, control group, and female-object group, that 
is, experimental group (m = 12 per group)). The learning and testing phases were 
conducted on the same day, 6 h apart. In the learning phase, all mice spent 30 min 
in their home cage in the middle of a well-lit room with the lid of the cage and 
metal grid holding water and food removed and a 30 cm tall white rectangular 
frame placed around the home cage to prevent mice from escaping. All the boxes 
contained one target object (counterbalanced objects within and between groups: 
empty methanol bottle or cryostat liquid bottle (sealed)). After 3 min exploring the 
target object, a wild-type female b6 mouse (age 9 to 16 weeks) was introduced in 
the boxes of the experimental mice and remained there for the next 27 min). The 
control mice did not experience a female mouse and only experienced the object. 
After a total of 30 min from the beginning of the learning phase, the object and 
female mouse were removed and the male mice returned to their holding rooms. 
In the testing phase, mice were placed in a rectangular arena (70 X 25 X 30 cm) 
with white floors. A video camera resides above the testing chamber where the 
locations of the subjects were tracked and recorded using Noldus EthoVision XT 
video tracking software. Two zones (left and right) on either end of the box (30 X 
30 cm) as well as a neutral zone in the centre of the box (10 cm) were denoted as 
part of the arena settings. Mice were introduced in the neutral zone of the empty 
arena and allowed to explore freely for 3 min. The tracking software monitored 
which of the two zones each individual mouse preferred. After 3 min, the experi- 
menter introduced two objects (empty methanol bottle or cryostat liquid bottle 
(sealed)) and placed them in the middle of the left and right zones. For each mouse, 
one of the objects was the same as the one experienced during training (target 
object in target zone) and was placed in the least preferred zone. The other object 
was novel (novel side) and placed in the preferred side. During minutes 6 to 9, 
objects were absent from the arena. During minutes 9 to 12, the objects were 
reintroduced in the same positions as minutes 3 to 6. 

Cell counting. The number of mCherry or c-Fos immunoreactive neurons in the 
dentate gyrus and downstream areas were counted to measure the number of 
active cells during defined behavioural tasks in 3-5 coronal slices (spaced 160 
uum from each other) per mouse. Only slices that showed accurate bilateral injec- 
tions in the dentate gyrus were selected for counting. Fluorescence images were 
acquired using a microscope with a X20/0.50 NA objective. All animals were 
sacrificed 90 min post-assay or optical stimulation for immunohistochemical 
analyses. The number of c-Fos-positive cells in a set region of interest (0.5 mm* 
per brain area analysed) were quantified with ImageJ and averaged within each 
animal. Background autofluorescence was accounted for by applying an equal cut- 
off threshold to all images by an experimenter blind to experimental conditions. 
To calculate the percentage of BLA, mPFC, or NAcc cells expressing ArchT-eGFP 
in Fig. 3c, we counted the number of GFP-positive cells and divided by the total 
number of DAPI-positive cells in each region. Statistical chance was calculated by 
multiplying the observed percentage of ArchT-GFP-single-positive cells by the 


observed percentage of c-Fos-single-positive cells; overlaps over chance were cal- 
(SEP xc-Fos* ) 
DAPI 


culated as observed overlap divided by chance overlap: “hance pveilay = 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


A one-way ANOVA followed by Tukey’s multiple comparisons or one-sample 
t-tests were used to analyse data and later graphed using Microsoft Excel with the 
Statplus plug-in or Prism. 

Neurogenesis. After all the behaviour tests, on the 15th day since the first day of 
light stimulation, the mice were overdosed with Avertin and perfused transcar- 
dially with cold phosphate buffer saline (PBS), followed by 4% paraformaldehyde 
(PFA) in PBS. Brains were extracted from the skulls and kept in 4% PFA at 4°C 
overnight. Coronal slices 50-m thick were taken using a vibratome and collected 
in cold PBS. For immunostaining, each slice was placed in PBST (PBS + 0.2% 
Triton X-100) with 5% normal goat serum for 1 h and then incubated with primary 
antibody at 4°C for 24 h (1:250 mouse anti-PSA-NCAM, Millipore; 1:500 dou- 
blecortin, AB2253, Millipore). Slices then underwent three wash steps for 10 min 
each in PBST, followed by a 1 h incubation period with secondary antibody (PSA- 
NCAM: 1:250 AlexaFluor488 anti-mouse, Invitrogen; Doublecortin: 1:300 
A21435, Invitrogen). Slices were then incubated for 15 min with 4’,6-diami- 
dino-2-phenylindole (DAPI; 1:10,000) and underwent three more wash steps of 
10 min each in PBST, followed by mounting and coverslipping on microscope 
slides. Images were taken using a Zeiss Axio Imager2 microscope. PSA-NCAM* 
or doublecortin® cells in the dentate gyrus granule cell layer were counted and 
normalized to the area of the granule cell layer for each brain slice using Image] by 
a researcher blind to the identities of each animal. After all the data were collected, 
the identities of each animal were revealed and the data were assigned back into 
each group for statistical analysis. 

In vivo electrophysiology. As described above, three mice were first bilaterally 
injected with an AAVo-TRE-ChR2-mCherry virus into the dentate gyrus followed 
by lowering a bilateral optic fibre implant into position and cementing it to the 
skull. Following 10 days for recovery and viral expression, in a separate surgery, 
mice were chronically implanted with a hyperdrive that housed six independently 
moveable tetrodes targeting the BLA. To accommodate the optic fibre implant 
cemented on the skull, the AP coordinate for the hyperdrive was adjusted slightly 
(centred at AP = —0.85 mm) and implanted at a ~ 15° angle. The electrical signal 
recorded from the tips of the tetrodes was referenced to a common skull screw over 
the cerebellum and differentially filtered for single unit activity (200 Hz to 8 kHz) 
and local field potentials (1-200 Hz). The amplified signal from each wire is 
digitized at 40 kHz and monitored with an Omniplex sytem (Plexon). Action 
potentials from single neurons were isolated off-line using time-amplitude win- 
dow discrimination through Offline Sorter (Plexon). Putative single units were 
isolated by visualizing combinations of waveform features (square root of the 
power, peak—valley, valley, peak, principal components, and time-stamps) 
extracted from wires composing a single tetrode. The average firing rate for iso- 
lated neurons was 2.25 Hz + 4.14 Hz (mean + s.d; range 0.01-30.15 Hz). 
However, the firing rate distribution was highly rightward skewed (median: 0.81 
Hz) and more than half of the neurons (62%; 66/106) had firing rates under 1 Hz. 
After the last recording session, small lesions were made near the tips of each 
tetrode by passing current (30 j1A for ~ 10s) and mice were transcardially perfused 
and brains extracted for histology using standard procedures. 

Recording and light stimulation protocol. Each mouse had two recording ses- 
sions that occurred on two different days separated by 72 h. Mice were first placed 


into a small recording chamber. In a single recording session, mice were first 
bilaterally stimulated in the dentate gyrus with blue light (450 nm; Doric 
Lenses) for 10s over 15 such trials in total. As a control, the blue light was replaced 
with red light (640 nm; Doric Lenses) and the mice were given twelve 10-s trials 
under this condition. The power output for the blue and red lights emitted from 
the tips of each patch cord was adjusted to 15-18 mW as measured with a standard 
photometer (Thor Labs). The blue and red lasers were powered using a laser diode 
driver (Doric Lenses) triggered by transistor-transistor logic (TTL) pulses emitted 
from a digital I/O card, and these events were also time-stamped and recorded in 
the Omniplex system. The recording session lasted ~20 min and each tetrode was 
lowered ~0.25 mm after the first recording session. 

Electrophysiological data analysis. Spiking activity was analysed using commer- 
cial (Neuroexplorer, NEX Technologies) and custom-made software in Matlab 
(R2014B). To visualize each neuron’s trial-averaged activity for the blue and red 
light stimulation period, a peristimulus time histogram (PSTH) with 100-ms time 
bins was generated with activity time locked to the onset of the blue or red light, 
and then smoothed with a Gaussian kernel (0 = 127 ms). In order to confirm a 
response during blue or red light stimulation period, 99% confidence intervals 
were constructed for the trial-averaged activity using a baseline 2.5 s period of 
spiking activity before the onset of each light under the assumption of Poisson 
spiking statistics (for example, Neuroexplorer, NEX Technologies). A neuron was 
considered to have a response for a particular light stimulation condition if trial 
averaged activity exceeded the upper (excitatory) or lower (inhibitory) bound of 
the 99% confidence interval. We considered neurons activated from dentate 
gyrus stimulation when a neural response was confirmed for the blue light 
condition but not the red light condition. For each neuron identified as such, 
we z-scored neural activity depicted in the blue and red light PSTH, then iden- 
tified the maximum trial-averaged z-score value from the 2.5 s baseline (Pre) and 
during blue or red light stimulation (Post). The Pre and Post maximum z-score 
values for the blue and red light stimulation period was compared using paired 
t-tests (Fig. 2p). 
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Extended Data Figure 1 | Male mice spend more time around an object 
associated with females. Top, Time spent in the target zone where the object 
associated with females is introduced in the ON phases. Female-object paired 
mice (experimental group) spend more time in the target zone during the 
ON phases than the neutral-object paired mice (control group; two-way 
ANOVA with multiple comparisons, ON] tgg = 2.41; P< 0.05, ON2 tgg = 2.08; 
P < 0.05). Bottom, Difference score (average of ON phases — baseline (Bsl)) 
also shows the increased preference for the target zone in the female-object 
group compared to neutral-object group (t2 = 2.37; *P < 0.05). n = 12 per 
group. See the Methods section for detailed methods. 
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Extended Data Figure 2 | Positive, neutral, or negative experiences label 
a similar proportion of dentate gyrus cells with ChR2; stress prevents 

weight gain over 10 days. a, The c-Fos mice were bilaterally injected with 
AAV -TRE-ChR2-mCherry and implanted with optical fibres targeting the 
dentate gyrus. b-e, Histological quantifications reveal that, while off Dox, a 
similar proportion of dentate gyrus cells are labelled by ChR2-mCherry in 


response to a positive (c), neutral (d), or negative (e) experience. All animals 


were sacrificed a day after completing the CIS protocol. One-way ANOVA 
followed by Bonferroni post hoc test, P > 0.05, n.s., not significant. f, Animals 
were chronically immobilized for 10 days, during which they lost a significant 
amount of weight compared to an unstressed group (one-way ANOVA 
followed by Bonferroni post hoc test, *P < 0.05, n = 9 per group). Data are 
means + s.e.m. 


©2015 Macmillan Publishers Limited. All rights reserved 


me No Stress + Positive Memory (ChR2) 
we No Stress + mCherry 
M@® Stress + Positive Memory (ChR2) 


~ Stress + Neutral Memory (ChR2) 
MM Stress + mCherry 
MM Stress + Negative Memory (ChR2) 


2K 


Latency to Feed (s) 


Light always on 


Extended Data Figure 3 | Reactivation of a positive memory decreases 
latency to feed in a novelty suppressed feeding paradigm. a, All groups were 
food deprived for 24 h and then underwent a novelty-suppressed feeding 
protocol. While chronic immobilization increased the latency to feed, light- 
reactivation of a positive memory significantly decreased the latency to feed 
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at levels that matched the unstressed groups. b, Upon completion of the 
novelty suppressed feeding test, all groups were returned to their home cage 
and food intake was measured after 5 min (one-way ANOVA followed 

by Bonferroni post hoc test, **P < 0.01, n = 16 per group). Data are 
means = s.e.m. 
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Extended Data Figure 4 | Activation of a positive memory elicits BLA 
spiking activity, requires NAcc glutamatergic activity in the tail suspension 
test, but does not alter locomotor activity in the open field test. a, Raster 
plots and peri-stimulus time histograms (PSTH) illustrating a transient 
excitatory response from a single BLA neuron out of the nine neurons 
responsive to dentate gyrus positive memory activation during 10 s of blue light 
stimulation, but not in response to 10 s of red light as a control. Blue bar plots on 
the right illustrate maximum BLA neural firing rate before (Pre) and after 
(Post) blue light stimulation in the dentate gyrus (paired t-test, tj = 6.91, 

*P = 0.023). Red bar plots show the maximum neural activity for the same 
neurons after red light stimulation in the dentate gyrus that serves as a control 
(paired t-test, ty = 1.62, P = 0.15). b, Brain diagram illustrating target areas 


manipulated. Within-subjects experiments revealed that glutamatergic 
antagonists (Glux), but not saline, in the accumbens shell blocked the light- 
induced effects of a positive memory in stressed subjects. For behavioural data, 
a two-way ANOVA with repeated measures followed by a Bonferroni post hoc 
test revealed a group-by-light epoch interaction on day 1 (F\,90 = 28.39, 

P < 0.001; n = 16 per group) and day 2 of testing (F,99 = 8.28, P < 0.01). 
Data are means ~ s.e.m. ¢, All groups failed to show significant changes in 
locomotor activity within a session of open field exploration during either 
light off or light on epochs, though any trends towards decreases in 
locomotion are consistent with stress-induced behavioural impairments. 

*P < 0.05, **P < 0.01, ***P < 0.001. 
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Extended Data Figure 5 | Activating a positive memory in the dentate gyrus 
produces an increase in c-Fos expression in the lateral septum and 
hypothalamus, but not the lateral habenula, ventral hippocampus, or VTA. 
a, Diagram of regions analysed. b, c, c-Fos expression significantly increased in 
the lateral septum (b) and subregions of the hypothalamus including the 
dorsomedial (DM), ventromedial (VM), and lateral hypothalamus (c) in the 
positive memory group but not in a group in which a neutral memory was 


stimulated or in a group expressing mCherry alone. c-e, c-Fos expression did 
not significantly increase in the lateral habenula (c), various ventral 
hippocampus subregions (d), or VTA, identified by tyrosine hydroxylase 
staining (red) stainings in the images expanded on the right (e) (one-way 
ANOVA followed by Bonferroni post hoc test *P < 0.05, **P < 0.01,n =5 
animals per group, 3-5 slices per animal). TS, tail suspension. Data are 
means + s.e.m. 
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Extended Data Figure 6 | Activating a positive memory through the dentate 
gyrus of unstressed animals increases c-Fos expression in various 
downstream regions. a, Diagram of regions analysed. b, In the positive 
compared to the neutral memory group, c-Fos expression is significantly 
increased in the lateral septum, NAcc shell, BLA, dorsomedial, ventromedial 
and lateral hypothalamus, but not in the mPFC, NAcc core, habenula, or 
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ventral hippocampus. Trends were observed in the central amygdala (CeA) 
and VTA. Each brain region was analysed using an unpaired Student’s f test, 
n = 5 animals per group, 3-5 slices per animal; #P = 0.17 for central amygdala 
and P = 0.09 for VTA; *P < 0.05, **P < 0.01, ***P < 0.001, n.s., not 
significant. Data are means + s.e.m. 
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Extended Data Figure 7 | Dopamine receptor antagonists block the light- 
induced effects of positive memory activation; a single session of activating a 
positive memory in the dentate gyrus does not produce long-lasting 
antidepressant-like effects. a, Administration of a cocktail of dopamine 
receptor antagonists (DAx) prevented the light-induced increases in struggling 
during the tail suspension test. When animals were tested again on day 2 and 
infused with saline, the behavioural effects of optically reactivating a positive 
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memory were observed (two-way ANOVA with repeated measures followed 
by Bonferroni post hoc test, *P < 0.05, n = 9 per group). b, Animals in which 
a positive memory was optically activated during the tail suspension test or 
sucrose preference test showed acute increases in time struggling or preference 
for sucrose; this change in behaviour did not persist when tested again on 
day 2 (within subjects ANOVA followed by Bonferroni post hoc test), 

n = 9. n.s., not significant. Data are means + s.e.m. 
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Extended Data Figure 8 | Chronic activation of a positive memory prevents dentate gyrus for the 5-day (b), 1-day (c), neutral (d), no stimulation (e), 


stress-induced decreases in neurogenesis. a, The 5-day positive memory natural (f), and no stress (g) groups. h-m, Representative PSA-NCAM images 
stimulation group showed a significant increase of adult newborn cells in the corresponding to data appearing in Fig. 4d. n = 5 slices per animal, 13 animals 
dentate gyrus as measured by doublecortin (DCX)-positive cells (one-way per group for data appearing in a. *P < 0.05, n.s., not significant. Data are 


ANOVA followed by Bonferroni post hoc test, F;.72 = 7.634, P< 0.01) relative means + s.e.m. 
to control groups. b-g, Representative images of DCX-positive cells in the 
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Extended Data Figure 9 | Behavioural and neuronal correlations. 
a, Performance levels in the SPT and the number of adult-born neurons as 
measured by PSA-NCAM are positively correlated on an animal-by-animal 
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basis. b, Performance levels between the TST and SPT show strong positive 
correlation trends on an animal-by-animal basis. n = 14 per TST behavioural 
group, n = 15 per SPT behavioural group. 
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a-Synuclein strains cause distinct synucleinopathies 
after local and systemic administration 


W. Peelaerts', L. Bousset?, A. Van der Perren’, A. Moskalyuk?, R. Pulizzi®, M. Giugliano®*+>®, C. Van den Haute!’, 


R. Melki? & V. Baekelandt! 


Misfolded protein aggregates represent a continuum with overlap- 
ping features in neurodegenerative diseases, but differences in 
protein components and affected brain regions’. The molecular 
hallmark of synucleinopathies such as Parkinson’s disease, demen- 
tia with Lewy bodies and multiple system atrophy are megadalton 
a-synuclein-rich deposits suggestive of one molecular event caus- 
ing distinct disease phenotypes. Glial a-synuclein (a-SYN) fila- 
mentous deposits are prominent in multiple system atrophy and 
neuronal o-SYN inclusions are found in Parkinson’s disease and 
dementia with Lewy bodies”. The discovery of a-SYN assemblies 
with different structural characteristics or ‘strains’ has led to the 
hypothesis that strains could account for the different clinico- 
pathological traits within synucleinopathies**. In this study we 
show that a-SYN strain conformation and seeding propensity lead 
to distinct histopathological and behavioural phenotypes. We 
assess the properties of structurally well-defined a-SYN assemblies 
(oligomers, ribbons and fibrils) after injection in rat brain. We 
prove that a-SYN strains amplify in vivo. Fibrils seem to be the 
major toxic strain, resulting in progressive motor impairment and 
cell death, whereas ribbons cause a distinct histopathological 
phenotype displaying Parkinson’s disease and multiple system 
atrophy traits. Additionally, we show that a-SYN assemblies cross 
the blood-brain barrier and distribute to the central nervous 
system after intravenous injection. Our results demonstrate that 
distinct a-SYN strains display differential seeding capacities, indu- 
cing strain-specific pathology and neurotoxic phenotypes. 

The discovery that Lewy bodies spread within the nervous system** 
led to the demonstration that fibrillar assemblies seed soluble «-SYN 
and spread between cells in cellular and animal models’. The most 
widely accepted paradigm postulates that pre-fibrillar oligomers, 
as opposed to mature fibrils, represent the neurotoxic entities in 
Parkinson’s disease*’. Although a direct quantitative in vivo compar- 
ison of the neurotoxic potential of different u-SYN assemblies is 
lacking, recent evidence indicates that fibrillar a-SYN toxicity is sig- 
nificantly greater than that of pre-fibrillar precursors*'°. Studies with 
inoculation of recombinant fibrils or pathogenic brain lysates in rodent 
brain triggered «-SYN pathology with varying efficiencies and mod- 
erate neuronal loss'!’”. In contrast, recombinant adeno-associated viral 
vector-mediated (rAAV) overexpression of «-SYN in rodent and prim- 
ate brain induces time-dependent robust behavioural impairment and 
dopaminergic neurodegeneration’*™. In this study, we compared the 
in vivo properties of «-SYN oligomers and two distinct «-SYN strains 
(‘fibrils’ and ‘ribbons’) (Extended Data Fig. la-e)*, by inoculation of 
highly purified and structurally characterized recombinant human 
a-SYN preparations in rat substantia nigra in absence and presence 
of rAAV-mediated human o-SYN overexpression. 

To assess «-SYN uptake and spreading in a detailed and quantitative 
way, we fluorescently labelled all a-SYN assemblies (Extended Data 


Fig. 1f-h). First, we applied ex vivo fluorescent tomography to visualize 
spreading of «-SYN in whole brain after inoculating 10 yg of a-SYN 
assemblies (Fig. la and Supplementary Table 1). Reconstructed images 
of the fluorescence volume after 20 min and 7 days showed that «-SYN 
oligomers spread more efficiently than fibrils, ribbons and free fluor- 
ophore (Fig. 1b). Subsequent histological analysis revealed that all 
assemblies were taken up in dopaminergic neurons, spread over the 
arborizing striatal dopaminergic axons and were transmitted trans- 
synaptically in a time-dependent manner (Extended Data Fig. 2a-g). 
These results build upon previous findings that the «-SYN oligomers 
disseminate more efficiently than higher molecular weight assemblies 
after intracerebral inoculation’*”*. 

The reason why certain Lewy body-bearing regions are subject 
to neurodegeneration and others not, remains incompletely under- 
stood. The fact that patients with Parkinson’s disease, dementia with 
Lewy bodies, and multiple system atrophy present variable striatoni- 
gral pathology is also confounding. We therefore assessed whether 
a-SYN oligomers, fibrils and ribbons translate their distinctive 
structural and biochemical properties into motor behavioural and 
neurotoxic phenotypes. Four months after nigral injection of ribbons 
or fibrils, Lewy body- and Lewy neurite-like inclusions formed essen- 
tially in dopaminergic neurons. These juxta-, peri-, nuclear and fila- 
mentous inclusions stained positive for «a-SYN phosphorylated at 
Ser129 (Pa-SYN), resembling the situation in Lewy body/Lewy neurite 
pathology-associated inclusions (Fig. 1c-f and Extended Data Fig. 3). 
Pa-SYN inclusions were considerably more abundant for a-SYN 
ribbons compared to fibrils with a predominant Lewy neurite-like 
filamentous phenotype staining positive for the aggresome marker 
SQSTM1I (also known as p62; Fig. leg). Injection of «-SYN oligomers 
and brain homogenate from aged 18-month transgenic mouse expres- 
sing mutant human a-SYN(A30P) did not result in detectable Pa-SYN 
accumulation. Combined rAAV-mediated «-SYN overexpression 
with ribbons and fibrils inoculation lead to increased Pa-SYN” cells 
(Extended Data Fig. 4). Remarkably, and only after combined rAAV- 
mediated «-SYN overexpression with a-SYN ribbons inoculation, a 
second but sparse o-SYN phosphorylation pattern was observed in 
oligodendroglial cells (Fig. 1h, i). 

In terms of neurotoxicity, nigral inoculation of brain homogenate, 
oligomers, fibrils and ribbons did not induce detectable dopaminergic 
cell death until the final time point of 120 days (Fig. 2a, b). In contrast, 
sole rAAV-mediated «-SYN overexpression (control) led to 27% cell 
loss of tyrosine hydroxylase-positive neurons and 32% striatal dopa- 
minergic axonal loss (Fig. 2c, d). Combined rAAV-a-SYN overexpres- 
sion with inoculation of brain homogenate, «-SYN oligomers, fibrils or 
ribbons accelerated tyrosine hydroxylase-positive cell death (Fig. 2c-e) 
and severely reduced striatal dopaminergic nerve terminal volume by 
61% and 43% for o-SYN fibrils and brain homogenate, respectively. 
Combined rAAV-a-SYN overexpression and inoculation of «-SYN 
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Figure 1 | a-SYN assemblies spread, but only strains cause Lewy-like 
pathology after injection in rat substantia nigra. a, Ex vivo macroscopic 
imaging of atto-550 labelled o-SYN after nigral inoculation (red injection spot) 
by reconstructing 11 separate excitation spots into a macroscopic 3D map 
allowing the visualization of «-SYN spread. Representative image of fluores- 
cently labelled oligomeric «-SYN distribution after 20 min and 7 days is shown. 
b, Quantification of change in a-SYN fluorescent volume (AFV) between 

20 min and 7 days as a measure for o-SYN spreading. Oligomers show a 
significant increase in spreading (1.78 + 0.44 s.e.m.) compared to free 

dye (—0.34 + 0.06 s.e.m.) and a-SYN fibrils (0.40 + 0.21 s.e.m.) (**P < 0.01, 
*P < 0.05 for one-way ANOVA with Tukey’s post-hoc analysis, n = 3). c, After 
4 months Lewy-like pathology is observed in dopaminergic neurons after 
inoculation of «-SYN strains (fibrils and ribbons). d, Schematic overview of 
Lewy body- and Lewy neurite-like inclusions in substantia nigra for ribbons 
and fibrils. e, Lewy neurite-like threads and club-shaped inclusions (upper 
and lower panel, respectively) show abundant inclusions for ribbons quantified 
in f (***P < 0.001 for one-way ANOVA with Tukey’s post-hoc analysis, 

n = 4). g-i, Confocal images with z-axis show phosphorylated inclusions in 
dopaminergic neurons colocalizing with the ubiquitin binding protein p62 
(g), whereas only ribbons result in GCI’s in the presence of elevated «-SYN 
levels after combined rAAV-driven o-SYN overexpression (h), further 
indicated by the oligodendroglial lineage marker Olig2 (i). Scale bars, 10 jum. 
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oligomers and ribbons did not yield additional tyrosine hydroxylase 
fibre loss (Fig. 2d). Assessment of spontaneous forepaw use with the 
cylinder test revealed a consistent significant motor deficit from 
60 days onwards upon injection of «-SYN fibrils (Fig. 2f). Combined 
rAAV-mediated o-SYN overexpression with «-SYN assemblies inocu- 
lation further aggravated motor deficits (Fig. 2g). These findings 
demonstrate that introduction of seeds under conditions where 
a-SYN is expressed at higher levels greatly enhances neurodegenera- 
tion in a strain-dependent manner. Thus, although ribbon inoculation 
induced more pronounced Lewy body/Lewy neurite-like inclusions 
(Fig. 1c-f), fibrils imposed the largest neurotoxic burden on the stria- 
tonigral pathway (Fig. 2h). 

The observation of a behavioural phenotype after inoculation of 
pure a-SYN strains in the absence of dopaminergic degeneration 
indicated that this functional deficit was not a downstream effect of 
dopaminergic cell death. Recent studies in cellular models, «-SYN 
transgenic animals and patients with incidental Lewy body disease, 
Parkinson’s disease and dementia with Lewy bodies have shown neur- 
onal dysfunction and motor abnormalities before overt neurodegen- 
eration and Lewy body/Lewy neurite pathology”. We therefore 
employed cellular electrophysiology to study the impact of a-SYN 
assemblies on neuronal and synaptic physiology. Endogenous 
a/B-SYN is calculated to be over 40 1M within the rat synaptosome”’. 
Four days exposure of rat primary cortical cultures to 1 uM of a-SYN 
assemblies left passive and excitable electrical membrane properties 
unaltered in single neurons, as recorded intracellularly by patch-clamp 
(Extended Data Fig. 5a-e). Instead, significant effects were observed 
on synaptic physiology. The frequency of occurrence of spontaneous 
bursts of action potentials revealed a direct substantial decrease after 
exposure to o&-SYN assemblies (Fig. 2i, j). Further evidence was 
obtained via substrate-integrated microelectrode arrays (MEAs) 
allowing simultaneous extracellular monitoring of the spontaneous 
neuronal network electrical activity from 59 independent microelec- 
trodes. The spontaneous firing of cortical neurons self-organizes 
ex vivo as episodic network-wide epochs of action potential synchrony, 
offering an indirect indicator of (dys)functional glutamatergic synaptic 
transmission. Exposure to low concentrations of different «-SYN 
assemblies revealed an increase in the average inter-spike intervals, 
which was significant for fibrils and oligomers (Fig. 2k, 1). 

Taken together, these data indicate that different «-SYN assemblies 
can affect neurotransmission after acute exposure, but only fibrillar 
a-SYN exhibited perpetual behavioural and aggravated neurotoxic 
phenotypes in vivo. To further investigate this, we assessed the capa- 
cities of «-SYN oligomers, ribbons and fibrils to seed endogenous 
a-SYN and to propagate, leading to long-term functional effects. We 
inoculated 40 jg of «-SYN assemblies divided over 3 injection sites in 
rat striatum in combination with rAAV-a-SYN overexpression 
(Fig. 3a) or not (Extended Data Fig. 6). Large amounts of sarkosyl- 
insoluble rodent «-SYN were detected after 60 days throughout the 
striatonigral pathway for ribbons and fibrils (Fig. 3a and Extended 
Data Fig. 6), although the inoculated human «-SYN was undetected 
after sarkosyl extraction in animals injected only with a-SYN assem- 
blies (Extended Data Fig. 7). This implies that exogenous o-SYN 
strains seed the assembly of endogenous a-SYN, overcoming the 
species barrier. In addition, the proteinase K resistance of sarkosyl- 
insoluble «-SYN extracted from the brains of animals overexpressing 
a-SYN inoculated with either ribbons or fibrils differed (Fig. 3b). 
Fibrils exhibited a higher resistance compared to ribbons, reflecting 
the capacity of each strain to amplify in a specific manner in vivo 
(Fig. 3b and Extended Data Figs 6 and 7). Hence, fibrils possess the 
highest capacity to translate their neuropathological potential from a 
defined structural to a wide spatiotemporal level. 

a-SYN knockout animals show no robust pathology, possibly 
because of functional redundancy (f- and y-SYN isoforms)’'. The 
remarkable capacity of «-SYN strains to persist and act as seeds 
by imprinting their intrinsic structures, causing synaptic defects, is 
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Figure 2 | a-SYN fibrils are the major toxic strain in vivo and cause synaptic 
impairment. a, b, Stereological quantification of tyrosine hydroxylase-positive 
(TH*) cells and striatal TH volume shows no significant cell or axonal loss 
after nigral injection of different «-SYN assemblies. c, d, Significant loss of 
dopaminergic cells and projecting axons is observed upon inoculation of 
fibrils and brain homogenate with rAAV-driven a-SYN overexpression 
(one-way ANOVA with Tukey’s post-hoc analysis versus contralateral side 
*EP < 0.001, **P < 0.01, *P < 0.05; or Dunnet’s post-hoc analysis versus 
control condition +P < 0.05; s.e.m, n = 4). e, Representative image of TH 
staining in substantia nigra and striatum after co-injection of o-SYN fibrils and 
rAAV-driven «-SYN overexpression after 4 months. Scale bar, 1,000 tum. 

f, Impaired motor behaviour after inoculation of «-SYN fibrils (**P < 0.01, 
*P < 0.05, one-sample t-test for a hypothetical value of 50% with Bonferroni 
post-hoc analysis; +P < 0.05 for one-way ANOVA with Tukey’s post-hoc 
analysis; s.e.m, n = 4) and g, increased decline of motor behaviour after 
combined o-SYN assemblies inoculation and rAAV-driven «-SYN 


a Striatum 


Figure 3 | a-SYN assemblies propagate in a strain-dependent manner. 

a, Sarkosyl-insoluble «-SYN is detected in striatum and substantia nigra after 
striatal inoculation of brain homogenate, oligomers, ribbons and fibrils in 
combination with rAAV-driven «-SYN overexpression. b, Degradation of 
sarkosyl-insoluble -SYN in brain extracts from fibrils and ribbons inoculated 
animals with rAAV-driven o-SYN overexpression by increasing proteinase 
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overexpression. h, Schematic overview indicating neurotoxic effects induced 
by different «-SYN strains with fibrils exhibiting the largest neurotoxic 
potential. Red plus (+) indicates data obtained upon rAAV-driven «-SYN 
overexpression. i-1, Reduced spontaneous collective firing in 4-week-old 
primary rat cortical neurons indicates synaptic impairment. i, Whole-cell 
patch-clamp intracellular recordings in single neurons revealed a progressive 
reduction of the rate of spontaneous emission of APs, quantified in j (*P < 0.05, 
**P < 0.01, Wilcoxon rank sum test, n = 32 cells). k, Representative raster plot 
of spontaneous action potential firing and the effect of fibrils on network-wide 
spontaneous episodic synchronization of neuronal action potentials. 

1, Significant lengthening of the average inter-spike intervals during the 
episodes of synchronization was observed across increasing concentrations of 
fibrils and oligomers (n = 9 cultures). No significant changes occurred during 
the entire incubation time in the total number of microelectrodes detecting 
spontaneous action potentials, indicating normal cell viability. 


Fibrils 
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K concentrations. The results reveal that each strain imprints its intrinsic 
structure to endogenous «-SYN. Western blot analysis was performed using 
the monoclonal anti-«-SYN antibody clone 42. The data correspond to the 
mean and associated standard error calculated from 3 independent 
measurements from 2 biological replicates. 
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Figure 4 | a-SYN strains cross the blood-brain barrier after systemic 
administration. a, «-SYN oligomers, ribbons and fibrils were intravenously 
injected every two weeks during 4 months in rat tail vein. b, Epifluorescent 
imaging reveals a fluorescent atto-647 signal originating from the brain region 
for a-SYN oligomers, ribbons and fibrils 4 months after the initial injection. 
c, Fluorescent yield is significantly higher for ribbons and oligomers compared 
to control conditions (**P < 0.01, *P < 0.05, one-way ANOVA with Dunnet’s 
post-hoc analysis versus control condition). d, Overview panel of human 


evidence for a gain-of-toxic function. a-SYN fibrils can multiply 
by simple elongation with addition of monomers at their ends or 
by breakage creating additional nuclei”. This growth process 
appears strain-dependent, making strains the crucial pathogenic 
factor. Indeed, several months after nigral injection and in sharp con- 
trast to all other assemblies, «-SYN fibrils were detectable in striatal 
dopaminergic axons (Extended Data Fig. 8). Therefore, and although 
all «-SYN assemblies might have a role during pathogenesis, fibrillar 
a-SYN exhibited the largest neurotoxic potential accompanied by 
disease-specific hallmarks at long-term (overview in Supplementary 
Table 2). 

In early stages of neurodegeneration pathological changes are often 
localized in defined brain areas, whereas at later stages neuronal dys- 
function and associated protein aggregates are widespread. Oligomeric 
prion proteins invade the central nervous system acutely after crossing 
the blood-brain barrier or chronically via the splenic-nerve route”. 
To assess whether «-SYN assemblies cross the blood-brain barrier, or 
behave in a similar manner, we repeatedly injected «-SYN oligomers, 
ribbons and fibrils intravenously on a two-weekly basis during 4 
months using two fluorescent labels, providing additional temporal 
resolution (Fig. 4a). No rAAV-a-SYN was used in these experiments. 
The red-shifted fluorescent label allowed epifluorescence imaging of 
the injected «-SYN strains in live animals. At the final time point we 
observed a striking, localized fluorescent signal originating from the 
brain for «-SYN oligomers, ribbons and fibrils, but not for control 
animals injected with saline or the free atto-647 label (Fig. 4b, c). 
Immunohistochemical staining revealed o-SYN deposits in the central 
nervous system for ribbons and fibrils (Fig. 4d, e and Extended Data 
Fig. 9) but only diffuse staining for oligomers (Fig. 4d, Extended Data 
Fig. 10 a-f). This could indicate that «-SYN oligomers might either 
diffuse to a higher extent because of their molecular mass or are partly 
washed out during perfusion, while «-SYN fibrils and ribbons remain 
in place after crossing the blood-brain barrier. Detailed histological 
analysis showed an accumulation of atto-550- and atto-647-labelled 
human o-SYN fibrils in cortical neurons (Fig. 4f) and in spinal cord 
(Fig. 4g). This pattern was accompanied with microglial activation, 
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a-SYN deposits and microglial activation (Mac1/CD11b staining) in thoracic 
spinal segment at low and high magnification. e, Quantification of a-SYN 
density in the spinal cord (cervical, thoracic and lumbosacral segments) shows 
a strong correlation between o-SYN density and microglial activation. 

f, Intravenous injection of «-SYN fibrils for 4 months results in 7-SYN 
inclusions in cortical neurons labelled with the neuron-specific marker BIII- 
tubulin. g, Confocal analysis of spinal cord reveals human «-SYN deposits 
labelled with the atto-550 and atto-647 markers. Scale bars, 20 um. 


which was most pronounced for fibrils throughout the spinal cord, 
and with redistribution of the early neuronal injury marker GAP43 
(Fig. 4d, e and Extended Data Fig. 10g). These observations dem- 
onstrate that, in this experimental setup, the spread of o-SYN 
assemblies is not restricted to the intracerebral route but extends 
beyond the central nervous system. 

The findings that «-SYN spread after intramuscular and gastric 
injection**”*, and the demonstration of systemic spread we make here 
are striking. The contribution of these routes to «-SYN assemblies 
spread and associated pathology deserves further investigation. In 
contrast to rapidly transmitted oligomers, the in vivo behaviour of 
slowly amplifying «-SYN strains may provide a basis for the observed 
heterogeneity in synucleinopathies and its slow but relentless pattern 
of spreading. This underlines the need for therapeutic strategies aimed 
at changing the surface properties of «-SYN assemblies either using 
epitope- or strain-specific antibodies or molecular chaperone-derived 
peptides” to interfere with the propagation of disease-associated 
a-SYN assemblies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Production and purification of recombinant a-SYN and rAAV-a-SYN. 
Recombinant o-SYN production was performed as previously described’. 
Vector production and purification was performed as previously described”*. 
The plasmids include the constructs for the AAV2/7 serotype, the AAV transfer 
plasmid encoding the human A53T mutant o-synuclein transgene under the 
control of the ubiquitous CMVie enhanced synapsinl promoter and the 
pAdvDeltaF6 adenoviral helper plasmid. Real-time PCR analysis was used for 
genomic copy determination. 

Endotoxin detection. Protein endotoxin detection was performed using LAL 
Chromogenic Endotoxin Quantitation Kit (Thermofisher catalogue no. 88282) 
following the manufacturer’s instruction. Briefly, 10 tg of «-SYN monomers, 
fibrils, ribbons and unbound Atto-550 were dissolved in 50 ll of endotoxin-free 
water provided in the kit and distributed in a 96-well microplate. The LAL reagent 
was added next and the plate was incubated 10 min at 37 °C in a thermomixer 
(Eppendorf, Germany). The chromogenic reagent was next added and the incuba- 
tion further extended 6 min. The reaction was arrested with 25% acetic acid. 
Absorbance at 405 nm was measured using a Flexstation microplate reader 
(FlexStation, Molecular Devices). Protein endotoxin units are presented in 
Extended Data Fig. 1. 

rAAV-o-SYN vector endotoxin detection was performed using LAL bacterial 
endotoxin test (Endosafe- PTSTM, Charles River) following the manufacturer 
instruction. Briefly, rAAV o-SYN vector was loaded in the sample reservoir, mixed 
with the LAL reagent and combined with the chromogenic substrate. The Optical 
density was measured and analysed against an internally achieved standard curve. 
The measurements were performed in quadruple. Endotoxin units were deter- 
mined to be 0.0000114 endotoxin unit per injection. 
Generation and labelling of a-SYN assemblies. «-SYN fibrils and ribbons were 
generated as previously described’. Oligomeric «-SYN was generated by incub- 
ating monomeric a-SYN 4 °C for 7 days. Oligomeric «-SYN was separated from 
monomeric o-SYN by size exclusion chromatography using a Superose6 HR10/30 
column (GE Healthcare) equilibrated in phosphate buffered saline (PBS) buffer. 

a-SYN assemblies in PBS were labelled by addition of 2 molar excess of the 
aminoreactive fluorescent dye atto-550 or atto-647 (ATTO-Tech GmbH). 
Labelling was performed following the manufacturer’s recommendations. 
Unreacted dye was removed by size exclusion chromatography or three cycles 
of sedimentation and suspension in PBS for oligomers or fibrils and ribbons, 
respectively. The amount of incorporated dye was assessed by mass spectrometry. 
The samples were de-salted (with 5% acetonitrile, 0.1% trifluoroacetic acid (TFA)) 
and eluted from a C18 reversed-phase Zip-Tip (Millipore, Billerica, MA, USA) 
in 50% acetonitrile, 0.1% TFA. Peptide samples were mixed in a ratio of 1:5 to 1:20 
(v/v) with sinapinic acid (10 mg ml‘) in 50% acetonitrile and 0.1% TFA) and 
spotted (0.5 pl) on a stainless steel MALDI target (Opti-TOF; Applied 
BioSystems). MALDI-TOF-TOF MS spectra were acquired with a MALDI- 
TOF-TOF 5800 mass spectrometer (Applied Biosystems) using linear mode 
acquisition. External calibration was performed using unmodified WT «-SYN. 
Acquisition and data analysis were performed using the Data Explorer software 
from Applied Biosystems. 

The nature of all «-SYN assemblies used was routinely assessed using a Jeol 1400 
(Jeol Ltd, Peabody, MA) Transmission Electron Microscope (TEM) after adsorp- 
tion of the samples onto carbon-coated 200-mesh grids and negative staining with 
1% uranyl acetate. The images were acquired with a Gatan Orius CCD camera 
(Gatan). 

Stereotactic injections. All animal experiments were carried out in accordance 
with the European Communities Council Directive of 24 November 1986 (86/609/ 
EEC) and approved by the Bioethical Committee of the KU Leuven (Belgium). 
Young adult female Wistar rats (Janvier, France) weighing about 200-250 g were 
housed under a normal 12 h light/dark cycle with free access to pelleted food and 
tap water. All surgical procedures were performed using aseptic techniques and 
ketamine (60 mg per kg intraperitoneally, Ketalar, Pfizer, Belgium) and medeto- 
midine (0.4 mg per kg, Dormitor, Pfizer) anaesthesia. Following anaesthesia the 
rodents were placed in a stereotactic head frame (Stoelting, IL, USA). Injections 
were performed with a 30-gauge needle and a 10-1 Hamilton syringe. After 
making a midline incision of the scalp, a burr hole was drilled in the appropriate 
location for the substantia nigra (single injections) or striatum (triple injections) at 
the right site of the skull. Stereotactic coordinates, dose and concentration of 
a-SYN protein and rAAV-a-SYN vectors are described in Supplementary Table 
1. Brain homogenate was extracted from 18 month old Thy-1 A30P transgenic 
mice” as described previously” adjusted to a total amount of 10 1g. Supplemented 
viral titres were diluted to 3.0 X 10'° genome copies per ml A53T «-SYN rAAV2/7. 
Fluorescent imaging. Epifluorescent images and fluorescent tomography 
were performed using the IVIS Spectrum in vivo imaging system (Caliper, 
PerkinElmer, version 4.3.1). Anaesthesia was induced in an induction chamber 
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with 3% isoflurane in 100% oxygen at a flow rate of 11 min’ * and maintained in 
the IVIS with a 1.5% mixture at 0.51 min” '. Rats were injected with 500 ll saline 
supplemented with 10 jg of fluorescently labelled «-SYN protein intravenously 
in the tail vein. Subsequently, they were placed in the prone position in the IVIS 
and epifluorescent images of atto-647 0-SYN labelled proteins were acquired at 
the final time point. Atto-550 o-SYN labelled proteins could not be imaged due 
to high autofluorescent background levels in live animals. Each frame depicts 
the fluorescent image as a pseudocolour image superimposed on the grey-scale 
photographic image. The data are reported on a logarithmic scale as the fluor- 
escent yield or fluorescence emission radiance per incident excitation intensity 
(ps cm? sr' pW! cm”) from a 2 cm” circular region of interest around 
the head. 

Fluorescent tomography ex vivo images of «-SYN spreading were acquired by 
merging 11 distinct and predefined transillumination excitation sources of per- 
fused and fixed rat brain. A small illumination spot is scanned over a predefined 
grid resulting in distinct illumination foci. The excitation source was positioned 
below the brain with the delivery of a concentrated excitation light beam and 
observed by a detector located above the object. Superimposing surface topo- 
graphy and fluorescent signal results in fluorescent tomography providing a 3D 
reconstruction of atto-550-tagged «-SYN fluorescent volume. Emitted fluorescent 
light propagation is calculated through diffusion approximation allowing the 
reconstruction of the fluorescent source in 3D. 3D reconstruction is presented 
as voxels with fluorescent intensity profile for each condition and is mapped to a 
3D brain atlas. Fluorescence yield is expressed in units of pmol M7! cm™!. A 
threshold % value was set to keep the minimum count of pixels above 200 to 
exclude background pixels from the reconstruction. The accuracy of reconstruc- 
tion was checked for all samples by comparing the measured light diffusion pattern 
with the simulated light diffusion pattern. If the % error between these two has a 
near-zero % difference, the reconstruction was considered successful and used for 
spatial measurements. 

Behavioural testing. The cylinder test was used to measure asymmetry in spon- 
taneous forelimb use. Contacts made by each forepaw with the wall of 20 cm-wide 
clear glass cylinder were scored from the videotapes by an observer blinded to the 
animal’s identity. Between 20 and 30 wall touches per animal (contacts with fully 
extended digits executed with the forelimb ipsilateral and contralateral to the 
lesion) were counted. The number of impaired forelimb contacts was expressed 
as a percentage of total forelimb contacts. Non-lesioned control rats should score 
around 50% in this test. No habituation of the animals to the testing cylinder was 
allowed before video recording. An investigator blind to different groups per- 
formed all the analyses. 

Histology. Rats were euthanized with an overdose of sodium pentobarbital (60 mg 
per kg intraperitoneally, Nembutal, Ceva Santé, Belgium) followed by intracardial 
perfusion with 4% paraformaldehyde in PBS. After postfixation overnight, 50 um 
thick coronal brain sections were made with a vibrating microtome (HM 650V, 
Microm, Germany). IHC was performed as previously described*! using antibod- 
ies summarized in Supplementary Table 3. For spinal cord staining, commercial 
TNB blocking buffer (Perkin Elmer) was added to all incubation steps to minimize 
background staining in 50 jum vibratome sections. 

For fluorescent double or triple staining, sections were rinsed three times in PBS 

and incubated overnight in the dark in PBS-0.1% Triton X-100 and 0.1% sodium 
azide, 10% donkey or goat serum and antibodies summarized in Supplementary 
Table 3. After rinsing, the sections were incubated in the dark for 2h in fluoro- 
chrome- conjugated secondary antibodies. After being rinsed and mounted, the 
sections were coverslipped with Mowiol. Fluorescent staining was visualized by 
confocal microscopy (Fluoview 1000, Olympus). Reconstructed 3D images from 
z-stack tiles were imaged using an Olympus confocal laser-scanning microscope 
and rendered into the complete neuronal reconstructions by tiling the entire set of 
z-stacks using Imaris software. 
Stereological quantification. The number of TH-positive cells and «-SYN-pos- 
itive cells in the substantia nigra was determined by stereological measurements 
using the optical fractionator method in a computerized system as described 
before” (StereoInvestigator; MicroBright-Field, Magdeburg, Germany). Every 
fifth section throughout the entire substantia nigra was analysed, with a total of 
seven sections for each animal. The coefficients of error, calculated according to 
the procedure of Schmitz and Hof as estimates of precision’’ varied between 0.05 
and 0.10. The volume of «-SYN expression in the brain and the substantia nigra 
was determined by the Cavalieri method. Every fifth section covering the entire 
extent of the substantia nigra, with a total of seven sections for each animal, was 
included in the counting procedure. The coefficients of error varied between 0.05 
and 0.14. We quantified both the injected and non-injected substantia nigra 
(internal control), no cell loss was observed in the non-injected side. An invest- 
igator blind to different groups performed all the analyses. 
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Sarkosyl extraction. Unfixed brains were dissected to isolate substantia nigra and 
striatum from both hemispheres and frozen in liquid nitrogen. Brain samples were 
weighted and homogenized at 10% (w/v) in PBS buffer supplemented with phos- 
phatase inhibitor cocktail (sigma P5726 and P0044) and protease inhibitors 
(Roche Complete EDTA free). Sarkosyl was added to 1%. 250 il were centrifuged 
at 1,000g for 30 min to remove cell debris. The supernatants (200 jul) were further 
centrifuged at 200,000g for 60 min at 20 °C in an Optima MAX-XP Ultracentrifuge 
(Beckman). The pellets were washed with PBS-1% Sarkosyl, and resuspended in 
200 yl of PBS buffer. 
Proteinase K digestion, SDS-, native-PAGE and western blot analysis. Sarkosy] 
insoluble fractions containing «-SYN were incubated at 25 °C with proteinase K 
(0.001, 0.01, 0.1, 1 pg ml‘) (Roche) for 30 min. Aliquots were mixed v/v with 
denaturing buffer (50 mM Tris-HCl, pH 6.8, 4% SDS, 2% B-mercaptoethanol, 12% 
glycerol and 0.01% bromophenol blue) and heated at 90 °C to arrest immediately 
the cleavage reaction. After incubation for 10 min at 90°C, the samples were 
processed to monitor the time course of o-SYN cleavage by SDS-PAGE (15% 
polyacrylamide gels) followed by western blotting and detection with Clone42 
monoclonal antibody (BD Bioscience ref. 610787) or anti-phospho-Ser 129 
a-SYN antibody. For native-PAGE, sarkosyl insoluble fractions subjected or not 
to PK treatment were supplemented with Triton X-100 (0.2%, v/v) B-mercap- 
toethanol (2%), glycerol (12%) and bromophenol blue (0.01%) and loaded on 15% 
native-PAGE. The samples were transferred to nitrocellulose membranes and 
probed with D37A6 or Syn 211 anti- rodent and human «-SYN antibodies 
(Supplementary Table 3). 
Electrophysiology and MEA recordings. Primary cortical cultures were obtained 
from postnatal Wistar rats, employing standard methods™. Briefly, after mech- 
anical and enzymatic dissociation of the cerebral cortices of postnatal day 0 (PO) 
pups, cells were plated at a final density of ~1.500 mm * on glass coverslips and of 
6.500 mm” on commercial microelectrode arrays (MEAs; Multichannel Systems 
GmBH, Reutlingen, Germany). Individual MEAs incorporated 59 substrate 
microelectrodes made of indium titanium nitrate, featuring a diameter of 30 um 
and an electrical impedance of ~100 kQ at 1 kHz. The inter-electrode spacing was 
200 jum, and the microelectrodes arranged in a regular 8 X 8 layout that covered an 
area of ~2.56 mm*. MEAs and coverslips were coated overnight with polyethy- 
leneimine 0.1% (w/v) in millQ water at room temperature, before cell seeding. 
Seeded MEAs and coverslips were maintained under sterile conditions at 37 °C 
under 5% CO2 atmosphere with 100% relative humidity for several weeks. 
Patch-clamp intracellular recordings were performed 22-23 days after plating, 
from neurons cultured under control conditions (that is, 1 uM bovine serum 
albumin) or incubated for 3-4 days with 1 11M «-SYN (that is, as ribbons, fibrils, 
oligomers). Patch-clamp recordings were made from the cell somata under the 
whole-cell configuration using an Axon Multiclamp 700B Amplifier (Molecular 
Devices LLC, Sunnyvale, CA, US), employing both current-clamp and voltage- 
clamp modes. Patch electrodes were pulled from thick-walled borosilicate glass 
capillaries (1BF150, World Precision Instruments, Hitchin, UK) by a horizontal 
puller (P97, Sutter, Novato, US) to a resistance of 6-7 MQ. Electrodes were filled 
with an intracellular solution containing (in mM): 135 K-gluconate, 10 KCl, 10 
HEPES, 0.2 EGTA, 4 Mg-ATP, 0.4 Na3GTP, and 14 Na)-phosphocreatine (pH 7.3, 
titrated with KOH). All recordings were obtained at 34°C and replacing the 
culture medium by an artificial cerebrospinal fluid, constantly perfused at a rate 
of 1 ml min™! and containing (in mM): 145 NaCl, 4 KCl, 2 Na-pyruvate, 5 HEPES, 
5 glucose, 2 CaCl,, and 1 MgCl, (pH adjusted to 7.4 with NaOH). Voltage and 
current traces were sampled at 15 kHz, analog to digital converted at 16 bits, and 
stored on a PC employing the software LCG**. Data were analysed off-line, 


employing custom scripts written in MATLAB (The MathWorks, Natick, US) 
or Excel (Microsoft, USA). 

Extracellular recordings experiments were performed 15-20 days after plating, 

and continued until day 28, from neurons cultured over MEAs under control 
conditions (that is, 1 14M bovine serum albumin) or incubated for up to 10 days 
with 1 11M a-SYN (that is, as ribbons, fibrils, oligomers). Recordings were per- 
formed by a MEA1060BC amplifier, (Multichannel Systems GmBH, Reutlingen, 
Germany) inside an electronic-friendly incubator, maintaining 37 °C, 5% COz, 
and 100% relative humidity. This allowed us to monitor simultaneously, chron- 
ically, and non-invasively the neuronal electrical activity from up to 60 locations 
within the very same culture. MEA recordings were analysed off-line by 
QSpikeTools**. Briefly, the extracellular electric fields were monitored from the 
59 independent electrodes in each MEAs were sampled at 25 kHz per channel, 
1,200X amplified, bandpass-filtered (200-3,000 Hz), and digitally recorded for 30 
min every day, up to 6 days following the bath application of o-SYN, dissolved in 
culture medium with distinct concentrations (that is, 0.1, 0.5, and 1 4M). Simple 
spike-sorting, based on spike-shape peak polarity, was performed. Epochs of 
spontaneous synchronized firing across the MEAs electrodes were identified over 
25 ms bins by standard procedures”, and their features were extracted from each 
recording session. 
Statistical analysis. Results are expressed as means + standard error of the mean. 
Normality of data was assessed with the Lilliefors test. Statistical significance was 
assessed using one-way ANOVA followed by Dunnet’s or Tukey’s post-hoc test 
and column statistics followed by Bonferroni’s post-hoc test. Null hypothesis was 
rejected if the P value was below 0.5. Statistical significance level was set as follows: 
* if P< 0.05, ** if P< 0.01, *** if P< 0.001. In case of improper fitting of data bya 
normal distribution, median and interquartile range were used for data repres- 
entation, and Wilcoxon rank sum test was used for analysis. 

No statistical methods were used to predetermine sample size. Randomization 
was not applied because all animals (acquired via commercial animal lab, Janvier) 
used in this study were similar for age, sex and strain background. 


28. Vander Perren, A. et al. Efficient and stable transduction of dopaminergic neurons 
in rat substantia nigra by rAAV 2/1, 2/2, 2/5, 2/6.2, 2/7, 2/8 and 2/9. Gene Ther. 
18, 517-527 (2011). 

29. Kahle, P. J. et al. Selective insolubility of alpha-synuclein in human Lewy body 
diseases is recapitulated in a transgenic mouse model. Am. J. Pathol. 159, 
2215-2225 (2001). 

30. Luk, K.C. etal. Intracerebral inoculation of pathological alpha-synuclein initiates a 
rapidly progressive neurodegenerative alpha-synucleinopathy in mice. J. Exp. Med. 
209, 975-986 (2012). 

31. Oliveras-Salva, M. et al. rAAV2/7 vector-mediated overexpression of alpha- 
synuclein in mouse substantia nigra induces protein aggregation and progressive 
dose-dependent neurodegeneration. Mol. Neurodegener. 8, 44 (2013). 

32. Baekelandt, V. et al. Characterization of lentiviral vector-mediated gene transfer in 
adult mouse brain. Hum. Gene Ther. 13, 841-853 (2002). 

33. Schmitz, C. & Hof, P. R. Design-based stereology in neuroscience. Neuroscience 
130, 813-831 (2005). 

34. Reinartz, S., Biro, |., Gal, A., Giugliano, M. & Marom, S. Synaptic dynamics contribute 
to long-term single neuron response fluctuations. Front. Neural Circuits 8, 71 
(2014). 

35. Linaro, D., Couto, J. & Giugliano, M. Command-line cellular electrophysiology for 
conventional and real-time closed-loop experiments. J. Neurosci. Methods 230, 
5-19 (2014). 

36. Mahmud, M., Pulizzi, R., Vasilaki, E. & Giugliano, M. QSpike tools: a generic 
framework for parallel batch preprocessing of extracellular neuronal signals 
recorded by substrate microelectrode arrays. Front. Neuroinform. 8, 26 (2014). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Monomers Oligomers Fibrils __ Ribbons 


e f 
100 

3 0,04 ioe 14460,4 _ +1 atto-550 = MV atloeAr 

c eS & x 

=< Q ay 2 

cg 2 2 2 

3E0m : 2 2 

SE 2 2g g 

me} Cc c Cc 

3 £ = £ 

Ww 0 

0 0 0 
© © 
SESS SH HD GHD GDH H HH OG EGP HM GF Pg 
&. Oe “ SF Ve GP CP HF Ream oP HP YP Raed & 
1°) Mass (m/z) Mass (m/z) Mass (m/z) 

Extended Data Figure 1 | Characterization of a-SYN assemblies. 0.05 units per injection for all conditions. f-h, Mass spectrometry analysis of 
a-d, Electron micrographs of «-SYN monomers (a), oligomers (b), fibrils covalently labelled fluorescent wild-type (WT) «-SYN by MALDI-TOF. 
(c) and ribbons (d) used throughout this study. The scale bars represent f, Unlabelled WT «-SYN (theoretical mass 14,460.1 Da); g, atto-550 WT a-SYN 
200 nm. e, Quantification of endotoxin amounts in different recombinant (average molecular ratio 1 atto molecule per a-SYN molecule) and h, atto-647 


a-SYN preparations used throughout this study. Endotoxin levels were below WT a-SYN (average molecular ratio 1 atto molecule per %-SYN molecule). 
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Extended Data Figure 2 | Trans-synaptic spreading of fluorescently labelled 
a-SYN in rat striatum. «-SYN assemblies were injected in rat substantia nigra 
(red injection spot) and were allowed to spread for 7 days to the striatum. 
Sagittal and coronal views are depicted in a and b, respectively. Representative 
confocal images of a-SYN spreading in c, dopaminergic neurons and d, striatal 
dopaminergic axons for a-SYN oligomers, ribbons and fibrils after 7 days. 
Reconstructed images from z-stack tiles of the dopaminergic neurons and 
axons in striatum were rendered into complete axonal reconstructions by tiling 
the entire set of z-stacks. Scale bar is 20 jum. The rectangle identifying Atto-550- 
tagged o1-SYN is shown at a higher magnification in upper right corner of 
sections in lower panels. e, Representative confocal image of recombinant 
oligomeric Atto-550 labelled a-SYN uptake in medium spiny neurons (MSN) 
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after injection in substantia nigra. White arrows indicate positive inclusions. 
f, Quantification of fluorescently labelled «-SYN DARPP-32* MSN. Uptake is 
measured after 1 and 7 days for oligomers (1 day = 1,674, 17 days = 25118, 
Nanimals = 3), ribbons (11 day = 2,152, 7 days = 1,968, Nanimals = 3) and fibrils 
(1 day = 1,223, 17 days = 1,486, Manimals = 3). Oligomers show increased uptake 
rates compared to ribbons and fibrils (***P < 0.001, **P < 0.01, one-way 
ANOVA with Dunnet’s multiple comparison test for 7 day time point) and 
increases over time (***P < 0.001, one-way ANOVA with Tukey’s multiple 
comparison test). Error bars indicate s.e.m., scale bars indicate 20 um. 

g, Correlation of spreading volume and trans-synaptic spread at 7 days. 
Oligomers show the highest spreading capacity compared to fibrils and ribbons 
using two independent techniques and experiments. 
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Extended Data Figure 3 | Histopathological hallmarks after nigral 
inoculation of a-SYN strains. a—c, Immunofluorescent staining in substantia 
nigra shows a, granular cytoplasmic, b, intracytoplasmic and c, nuclear 
Pa-SYN inclusions in dopaminergic neurons. Reconstructed images from 


z-stack tiles were rendered into complete reconstructions by tiling the entire 
set of z-stacks. Scale bars indicate 20 Lm. 
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Extended Data Figure 4 | Phosphorylation pattern of different a-SYN phosphorylated «-SYN is observed upon injection of ribbons and fibrils, to 
assemblies upon rAAV-driven a-SYN overexpression. a, Inoculation of 22,254 cells + 2,800 (s.e.m, n = 4) and 19,690 cells + 1,803 (s.e.m, m = 4), 
ribbons and fibrils result in increased amount of phosphorylated o-SYN respectively (**P < 0.01, *P < 0.05, one-way ANOVA with Dunnet’s multiple 
cells in substantia nigra. Scale bar, 50 lum. b, Animals where «-SYN comparison test versus control, *P < 0.05, one-way ANOVA with Tukey’s 
overexpression is rAAV-driven yield a total of 12,426 + 1,288 cells with multiple comparison test). 


phosphorylated «-SYN (s.e.m, n = 4). A significant increase of cells with 
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Extended Data Figure 5 | Intracellular and extracellular electrophysio- 
logical recordings of early effects of a-SYN assemblies on the electrical 
properties of single neurons. Rat primary cortical neurons, cultured ex vivo 
for over 4 weeks, displayed no significant alterations of the intrinsic electrical 
phenotype after exposure for 3-4 days to 1 1M of o-SYN assemblies. This was 
characterized by means of intracellular recordings, and analysed in current- 
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clamp in terms of membrane passive properties (n = 32 cells), a, as membrane 
time constant, b, membrane apparent input resistance, c, resting membrane 
potential and e, as evoked excitable responses (n = 30 cells). By the same 
technique, the amplitude of d, excitatory spontaneous post-synaptic currents 
(PSCs) revealed a significant downregulation, recorded under voltage-clamp 
(n = 32 cells). 
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Extended Data Figure 6 | a-SYN strains propagate after intracerebral 
inoculation. a, Preformed o-SYN fibrils and ribbons resist 1% sarkosyl 
treatment and are pelleted upon sedimentation at 200,000g for 60 min at 25 °C 
in vitro. (T, total; S, Soluble; P, Pellet) b, Fibrils resist 1% sarkosyl treatment and 
persist more after striatal injections than ribbons or oligomers. Sarkosyl- 
insoluble «-SYN from animals inoculated fibrils or ribbons is phosphorylated. 
Mean and associated standard error was calculated from 3 independent 
quantifications of the intensities made using Image] on samples from two 
independent experiments. 
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Extended Data Figure 7 | a-SYN strains recruit rodent a-SYN. Seeding of 
endogenous a-SYN in rat brain by exogenous human ribbons, fibrils or 
diseased rat brain homogenates. PBS was inoculated to the control animal. The 
nature of o#-SYN assemblies was assessed by western blot following native 
PAGE separation prior (—) or after (+) proteinase K (76 ng ul?) treatment. 
a, Staining with rat-specific antibody (D37A6) detects endogenous o-SYN and 
b, staining with human-specific antibody (Syn211) detects exogenous 
injected o-SYN on a duplicate blot. Arrows indicate fibrillar 7-SYN at the 
bottom of the gel well. The non specific band (*) is recognized by the secondary 
antibody. Representative image from 3 independent experiments is shown. 
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Extended Data Figure 8 | a-SYN fibrils persist months after intracerebral 
inoculation. a, «-SYN assemblies were injected in rat substantia nigra (red 
injection spot) and assessed 4 months later in striatum. b, Immunohisto- 
chemical analysis of rat striatum reveals clear human «-SYN™ inclusions 
colocalizing with dopaminergic axons. Representative image of fibrils is shown 
with detailed image in right upper corner. Right panel shows pseudocolour 
image of only human @-SYN. Scale bar indicates 30 jum. c, Quantification of 
different «-SYN strains in rat striatum by means of total fluorescent units. 


Fibrils appear to persist to a much higher extent compared to ribbons 

(***P < 0.001, one-way ANOVA with Tukey’s multiple comparison test, 

n = 4). Oligomers and brain homogenate show very low immunofluorescent 
staining for human-specific «-SYN. d, Fibrils appear as axonal inclusions 
and are abundantly present in striatum in contrast to ribbons after nigral 
injection. Oligomers and brain homogenate are not detected using the 

same human-specific antibody Syn211. Scale bar indicates 40 jim. 
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Extended Data Figure 9 | a-SYN ribbons and fibrils spread across the system after intravenous injection of a-SYN oligomers, ribbons and fibrils. 
central nervous system after intravenous administration. Overview of Scale bars indicate 50 fm. 


human «-SYN (Syn211) inclusions in different areas of the central nervous 
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Extended Data Figure 10 | Intravenous injection of a-SYN strains leads to 
increased microglial response and distinct histopathological hallmarks in 
the spinal cord. Quantification of human «-SYN (hua-SYN) density and 
Macl/CD11b microglial response following intravenous administration of a 
saline solution or «-SYN oligomers, ribbons or fibrils in a, b, cervical, 

c, d, thoracic and e, f, lumbosacral segments (**P < 0.05, one-way ANOVA 
with Dunnet’s multiple comparison test versus control, ***D < 0.001, “P< 
0.01, *P < 0.05, one-way ANOVA with Tukey’s multiple comparison test, n = 
4). Error bars indicate s.e.m., white boxes and grey boxes indicate 60 and 120 
days after intravenous injections, respectively. g, Immunohistochemical 
staining for growth associated protein 43 (GAP43), an early injury marker, 
shows a filamentous distribution for 2-SYN ribbons and cellular distribution 
for a-SYN fibrils (white arrow heads). Scale bars indicate 50 tm. 
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Metastatic disease remains the primary cause of death for patients 
with breast cancer. The different steps of the metastatic cascade rely 
on reciprocal interactions between cancer cells and their micro- 
environment. Within this local microenvironment and in distant 
organs, immune cells and their mediators are known to facilitate 
metastasis formation’”. However, the precise contribution of tumour- 
induced systemic inflammation to metastasis and the mechanisms 
regulating systemic inflammation are poorly understood. Here we 
show that tumours maximize their chance of metastasizing by evok- 
ing a systemic inflammatory cascade in mouse models of spontan- 
eous breast cancer metastasis. We mechanistically demonstrate that 
interleukin (IL)-1f elicits IL-17 expression from gamma delta (5) 
T cells, resulting in systemic, granulocyte colony-stimulating factor 
(G-CSF)-dependent expansion and polarization of neutrophils in mice 
bearing mammary tumours. Tumour-induced neutrophils acquire 
the ability to suppress cytotoxic T lymphocytes carrying the CD8 
antigen, which limit the establishment of metastases. Neutralization 
of IL-17 or G-CSF and absence of y6 T cells prevents neutrophil 
accumulation and downregulates the T-cell-suppressive phenotype 
of neutrophils. Moreover, the absence of yé T cells or neutrophils 
profoundly reduces pulmonary and lymph node metastases with- 
out influencing primary tumour progression. Our data indicate that 
targeting this novel cancer-cell-initiated domino effect within the 
immune system—the 6 T cell/IL-17/neutrophil axis—represents a 
new strategy to inhibit metastatic disease. 

In patients with breast cancer, increased neutrophil abundance 
predicts worsened metastasis-specific survival**. Currently, the role 
of neutrophils in metastasis is controversial, since both pro- and anti- 
metastatic functions have been described*’. We found a profound sys- 
temic expansion of neutrophils in mammary tumour-bearing K14cre; 
Cdhi*"; Trp53” ¥ (KEP) mice®, compared with wild-type (WT) litter- 
mates (Extended Data Fig. la, b). Neutrophils, defined as CD45* 
CD11b" Ly6G* Ly6C*F4/80° cells, accumulated throughout every organ 
examined (Extended Data Fig. 1c, d). We also investigated our recently 
described KEP-based model of spontaneous breast cancer metastasis” 
(Fig. 1a), where systemic neutrophil expansion was observed as well 
(Fig. 1b and Extended Data Fig. le, f). Neutrophil expansion was 
tumour-induced, because surgical removal of the primary tumour re- 
sulted in their immediate reduction (Extended Data Fig. 1g). 

To determine the functional significance of neutrophils in metastasis, 
neutrophils were depleted using anti-Ly6G antibodies (Extended Data 
Fig. 2a—c). Treatment was initiated when tumours were palpable and 
continued until mice developed overt metastatic disease. Neutrophil 
depletion did not influence tumour growth (Extended Data Fig. 2d), 
tumour histopathology (Extended Data Fig. 2e) or microvessel density 
(Extended Data Fig. 2e, f). In contrast, neutrophil depletion resulted in 
significant reduction in both pulmonary and lymph node metastasis 


(Fig. 1c). These data indicate that neutrophils assist the spread of 
cancer cells to multiple locations. 

Next, we evaluated the role of neutrophils in different phases of the 
metastatic cascade. Neutrophils were depleted during primary tumour 
growth (early phase) or after removal of the primary tumour (late phase) 
(Fig. la). Interestingly, neutrophil depletion decreased multi-organ 
metastasis in the early phase, but not the late phase (Fig. 1d). Metas- 
tatic nodule size was not affected (Extended Data Fig. 2g), suggesting 
that neutrophils facilitate multi-organ metastasis during the early steps 
of the metastatic cascade. 

To understand the mechanism by which neutrophils facilitate meta- 
stasis, their phenotype was investigated. Previous reports identified the 
haematopoietic stem cell marker cKIT on pro-metastatic myeloid cells*'°, 
and CD11b* VEGERI* cells have been implicated in the pre-metastatic 
niche’®’?"*, A greater proportion of neutrophils from tumour-bearing 
KEP mice expressed cKIT, while both neutrophils and monocytes from 
WT and KEP mice expressed VEGFR1 (Extended Data Fig. 3a, b). In 
the metastasis model, cCKIT* neutrophils also expanded systemically, 
as tumours grew larger (Extended Data Fig. 3c) and reduced to baseline 
levels after tumour resection (Extended Data Fig. 3d). Nuclear morphol- 
ogical analysis revealed characteristics of immature cells’*, including 
banded, circular and non-segmented nuclei, whereas most WT neu- 
trophils appeared hyper-segmented (Extended Data Fig. 3e), suggest- 
ing that KEP mammary tumours promote the release of immature 
neutrophils into circulation. 

Next-generation RNA sequencing (RNA-seq) was performed on cir- 
culating neutrophils from WT and tumour-bearing KEP mice, revealing 
100 differentially expressed genes (Extended Data Fig. 4a and Extended 
Data Table 1). Several genes upregulated in neutrophils from KEP mice, 
including Prok2 (also known as Bv8), S100a8 and S100a9 (Fig. 2a), 
have previously been linked to metastasis®'*. Nos2, the gene encoding 
inducible nitric oxide synthase (iNOS), was the most strongly upregu- 
lated gene, by more than 150-fold (Fig. 2a). Because iNOS suppresses 
T cells’, we hypothesized that neutrophils promote metastasis via 
immunosuppression. Indeed, neutrophils from KEP mice inhibited 
the CD3/CD28-induced proliferation of naive splenic CD8* T cells ex 
vivo compared with WT neutrophils, and an iNOS inhibitor reversed 
this effect (Fig. 2b and Extended Data Fig. 4b). In lungs of control 
and neutrophil-depleted tumour-bearing mice, the proportions of 
CD8* T cells did not differ (Extended Data Fig. 4c). However, the 
effector phenotype of CD8* T cells was markedly enhanced upon neu- 
trophil depletion, as evidenced by a significantly greater proportion of 
CD62L” CD44* and interferon-y* (IFN-7~) cells (Fig. 2c and Extended 
Data Fig. 4d, e). To establish further a mechanistic link between neu- 
trophils and CD8™ T cell activity, we depleted both cell populations in 
the metastasis model. Combined depletion of neutrophils and CD8* 
T cells reversed the metastasis phenotype of neutrophil depletion alone 
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Figure 1 | Neutrophils promote breast cancer metastasis. a, Spontaneous 
metastasis model. Tumour fragments from KEP mice are orthotopically 
transplanted into WT female FVB/N recipient mice (designated by 1), allowed 
to proliferate (2), then surgically resected (3). Metastases develop in 100% 

of recipient mice. Antibody-mediated depletion experiments were performed 
in three ways: from palpable tumours to metastasis-related death (continuous 
treatment), during primary tumour growth (early phase) or after surgery 
until metastasis-related death (late phase). b, Neutrophil proportions in lungs 
at the indicated tumour size (n = 6, 5, 6 and 8 mice for 0, 9, 25 and 100 mm’, 
respectively; Kruskal-Wallis test followed by post-hoc Dunn’s test). c, d, Images 
of cytokeratin 8-stained lung sections, quantification of lung metastases 

and incidence of metastasis in lymph nodes. Neutrophils were depleted 
continuously until metastasis-related death in c (n = 11 mice per group; 
Mann-Whitney U-test and Fisher’s exact test) or depleted during the early or 
late phases in d (n = 9 control, 11 early phase, 14 late phase; Kruskal-Wallis test 
followed by post-hoc Dunn’s test and Fisher’s exact test). Data in d are 
representative of two independent experiments. All data are mean + s.e.m. 
*P < 0.05, **P < 0.01, ***P < 0.001. Scale bars, 6mm. 


(Fig. 2d), without affecting primary tumour growth (Extended Data 
Fig. 4f). Depletion of CD8" T cells alone did not alter tumour growth 
or multi-organ metastasis (data not shown). These data suggest that neu- 
trophils facilitate cancer cell spread by suppressing CD8" T cells. As 
such, neutrophils in the KEP model can be categorized as a subpopu- 
lation of myeloid-derived suppressor cells». 

We then asked how mammary tumours induce systemic neutrophil 
expansion. Cytokine profile comparison of WT mammary glands and 
KEP mammary tumours showed that granulocyte-macrophage colony- 
stimulating factor (GM-CSF) and G-CSF levels—two key regulators of 
neutrophil biology’’—were not significantly increased in KEP tumours 
(Extended Data Fig. 5a, b). However, expression of IL-1, IL-6 and IL- 
12p40, a subunit of IL-23, was increased (Extended Data Fig. 5a, b). 
These cytokines are known to stimulate IL-17 from lymphocytes”. 
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Figure 2 | Neutrophils suppress CD8* T cell activation to facilitate 
metastasis. a, Gene expression in circulating neutrophils (n = 5 WT, 10 KEP 
mice). b, Circulating neutrophils from either WT (n = 7) or tumour-bearing 
KEP mice (n = 8) were incubated with CFSE-labelled splenic CD8* T cells 
from WT mice and CD3/CD28 stimulation beads. The iNOS inhibitor 
L-NMMaA was added where indicated (n = 8). After 48 h, CD8* T cell 
proliferation was measured. c, CD8* T cell activation status in lungs of 
transplanted tumour-bearing control and neutrophil-depleted mice (n = 6 per 
group). d, Quantification of lung metastases and incidence of lymph node 
metastasis following neutrophil and CD8* T cell depletion (n = 11 control, 
16 anti-Ly6G, 8 anti-Ly6G/CD8; Kruskal-Wallis test followed by post-hoc 
Dunn’s test and Fisher’s exact test). All data are mean + s.e.m. *P < 0.05, 
**P < 0.01, ***P < 0.001. 


In inflammatory diseases such as psoriasis, lymphocyte-derived IL-17 
regulates neutrophil expansion via systemic induction of G-CSF*?”*, 
We hypothesized that the same inflammatory cascade is important in 
breast cancer metastasis. Indeed, serum levels of IL-17A and G-CSF were 
higher in tumour-bearing KEP mice than in WT mice (Fig. 3a). Neutral- 
ization of IL-17A in tumour-bearing KEP mice decreased G-CSF serum 
levels, while G-CSF blockade did not affect IL-17A levels (Fig. 3a), 
indicating that IL-17 is upstream of G-CSF. Inhibition of either cytokine 
reduced circulating neutrophils, lowered cCKIT* neutrophil proportions 
(Fig. 3b) and reversed neutrophil phenotype (Fig. 3c). Injection of 
recombinant G-CSF to anti-IL-17-treated tumour-bearing KEP mice 
overcame the effects of anti-IL-17 treatment (Extended Data Fig. 5c-e). 
Additionally, treatment of WT mice with recombinant G-CSF resulted 
in neutrophil expansion, increased presence of cKIT™ neutrophil pro- 
portions and changed neutrophil phenotype (Extended Data Fig. 5c-e). 
These data demonstrate a requirement for the IL-17/G-CSF signalling 
cascade in both neutrophil expansion and phenotype. 

Next, we determined the source of IL-17. As T cells are known to 
produce IL-17 (refs 20, 21, 23), splenic CD3™ T cells were analysed using 
a T cell-specific gene expression array. This analysis validated upregu- 
lation of IL-17-related cytokines in T cells from tumour-bearing KEP 
mice (Extended Data Fig. 5f). We then asked whether lymphocytes are 
the only source of IL-17 and whether they drive metastasis. KEP mice 
were crossed with Rag] ‘~ mice, which lack T and B cells. Tumour ini- 
tiation, proliferation and histology were the same between KEP;Rag1 a 
and KEP;Ragl ~'~ mice (Extended Data Fig. 6a and data not shown). 
However, KEP;Rag1 ~’~ mice exhibited lower levels of IL-17A and G-CSF 
in serum (Fig. 3d), decreased neutrophil counts (Fig. 3e) and altered 
neutrophil phenotype (Fig. 3f). Transforming growth factor B1 (TGF- 
61) levels were unchanged between KEP;Rag]‘/~ and KEP;Ragl ‘~ 
mice (Extended Data Fig. 6b), suggesting that, unlike other models”, 
TGF-B plays a lesser role in modulating neutrophil phenotype than 
IL-17-induced G-CSF. Importantly, KEP;Rag1~'~ mice displayed less 
pulmonary and lymph node metastases (Fig. 3g). The metastasis phe- 
notype in KEP;Ragl ~’~ mice was validated in the metastasis model 
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Figure 3 | Lymphocyte-derived IL-17 is required for G-CSF-induced 
neutrophil expansion and phenotype. a, Cytokine levels in serum of WT 
(n = 5), tumour-bearing KEP mice (n = 9) and anti-IL-17- (n = 7) or anti-G- 
CSF-treated KEP mice (n = 6). b, Proportions of circulating neutrophils and 
cKIT-expressing neutrophils in KEP mice during primary tumour growth 

(n = 9 control, 8 anti-IL-17, 6 anti-G-CSF). c, Gene expression in circulating 
neutrophils from tumour-bearing KEP control mice (n = 9), anti-IL-17- 

(n = 6) or anti-G-CSF-treated KEP mice (n = 6). d, Cytokine levels in serum 
of tumour-bearing KEP;Rag]*/~ (n = 9) and KEP;Ragl~‘~ mice (n = 7). 

e, Absolute blood neutrophil counts in tumour-bearing KEP;Rag1 +/~ mice 
(n = 8) or KEP;Ragl /~ mice (n = 5). f, Gene expression in circulating 
neutrophils from KEP;Ragl */~ (n = 10) and KEP;Ragl ~~ mice (n = 8). 

g, Percentage of tumour-bearing mice with lung or lymph node metastasis 
(n=50 KEP;Ragl*"~, 32 KEP;Rag1/~ mice). All data are mean + s.e.m. 

*P < 0.05, **P < 0.01, ***P < 0.001 as determined by Mann-Whitney U-test 
or Fisher’s exact test. 


where Rag] ‘~ mice were recipients of transplanted KEP tumour frag- 
ments, resulting in reduced pulmonary metastasis (Extended Data Fig. 6c). 
Thus, IL-17-producing lymphocytes drive neutrophil accumulation, 
phenotype and metastasis. 

Direct ex vivo intracellular cytokine staining was performed to deter- 
mine which T lymphocyte subset produces IL-17. Both CD4* T cells 
and v6 T cells expressed IL-17A (Fig. 4a), and both IL-17-producing 
subpopulations were increased in various organs of tumour-bearing 
KEP mice compared with WT mice (Fig. 4b and Extended Data Fig. 7a). 
In primary tumours, the abundance of y5 and CD4* T cells was too 
low (<0.2% and <2% of all live cells, respectively) to assess IL-17 ex- 
pression reliably. yé T cells exhibited higher IL-17A levels than CD4* 
T cells (Fig. 4a and Extended Data Fig. 7b). Both cell populations were 
depleted to determine their functional importance. CD4* T cell deple- 
tion lowered cKIT* neutrophils, but failed to influence total neutrophil 
expansion, IL-17A or G-CSF levels (Extended Data Fig. 7c-e). Con- 
versely, depletion of 6 T cells decreased IL-17A and G-CSF serum levels 
(Fig. 4c), reduced circulating neutrophils, lowered cKIT* neutrophil 
proportions (Fig. 4d and Extended Data Fig. 7c) and reversed neutro- 
phil phenotype (Fig. 4e). These data indicate that IL-17-producing y6 
T cells promote neutrophil expansion and phenotypic alterations. IL- 
17-producing 76 T cells in tumour-bearing KEP mice were CD27, 
mostly Vy4", and a proportion expressed CCR6, IL-1R1 and ROR-yt 
(Extended Data Fig. 8a, b) similar to other inflammatory diseases”’. 
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Figure 4 | IL-1B-activated, IL-17-producing y6 T cells regulate neutrophil 
expansion, neutrophil phenotype and metastasis. a, Intracellular staining 
within circulating T cells of tumour-bearing KEP mice. b, Proportion of IL- 
17A-producing 76 T cells (WT, n = 5; KEP, n = 6). c, Cytokine levels in serum 
of control (m = 10) and anti-y5TCR-treated (n = 7) KEP mice. d, Proportions 
of circulating neutrophils and cKIT-expressing neutrophils in KEP mice 
during primary tumour growth (n = 8 per group). e, Gene expression in 
circulating neutrophils from tumour-bearing KEP control mice (n = 10) and 
anti-ySTCR-treated KEP mice (n = 6). f, Proportion of IL-17A-producing 

75 T cells in tumour-bearing mice (n = 6 KEP control, 5 anti-IL-23p19, 5 anti- 
IL-1). g, Cytokine levels in serum (n = 9 KEP control, 5 anti-IL-23p19, 

6 anti-IL-1). h, Proportions of circulating neutrophils and cKIT-expressing 
neutrophils in KEP mice during primary tumour growth (n = 9 control, 5 anti- 
IL-23p19, 5 anti-IL-18). i, j, Quantification of lung metastases and incidence 
of lymph node metastasis in the metastasis model (n = 10 control, 9 anti- 
ySTCR-treated mice; n = 9 per group Terd*’~ and Tcrd ‘~ mice). All data 
are mean + s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001 as determined by 
Mann-Whitney U-test or Fisher’s exact test. 


We then asked how KEP mammary tumours activate IL-17-producing 
6 T cells. On the basis of literature*° ~ and cytokine analysis (Extended 
Data Fig. 5a, b), we focused on IL-23 and IL-1. IL-17A expression by 
5 T cells, G-CSF serum levels and neutrophil expansion was decreased 
by neutralization of IL-1, but unaffected by inhibition of IL-23 (Fig. 4f-h). 
Macrophages were the most abundant IL-1f-expressing cell type in 
KEP tumours (Extended Data Fig. 8c, d). These data provide a mech- 
anistic link between mammary tumours and y6 T cells. 

Depletion of 76 T cells in the early phase of the metastasis model did 
not affect tumour histopathology, microvessel density or primary tumour 
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growth (Extended Data Fig. 8e and data not shown). Importantly, how- 
ever, pulmonary and lymph node metastases were significantly decreased 
in y0 T cell-depleted mice (Fig. 4i). These data were validated with 
Tcrd’‘— mice, which lack v6 T cells. KEP tumour fragments were 
orthotopically transplanted into Tcrd‘’~ and Terd ’~ mice and re- 
sected after outgrowth. Genetic elimination of y6 T cells also resulted 
in a significant reduction in pulmonary metastasis (Fig. 4j) without 
affecting primary tumour growth (Extended Data Fig. 8f). These data 
confirm a pro-metastatic role for y6 T cells. 

In summary, we show that mammary tumour-induced, IL-17- 
producing 5 T cells drive systemic expansion and polarization of neu- 
trophils towards a CD8" T cell-suppressive phenotype and subsequent 
metastasis formation in distant organs (Extended Data Fig. 9). The im- 
portance of neutrophils during the early steps of the metastatic cascade 
and the upregulation of Prok2, $100a8 and $100a9 in neutrophils 
suggest that neutrophils may help to establish the pre-metastatic 
niche*’®"*; however, the role of these neutrophil-derived factors and 
others remains to be established in the KEP model. In patients with 
breast cancer, independent clinical studies consistently point towards 
a pro-metastatic role for neutrophils, y6 T cells and IL-17 (refs 3, 4, 
26-29). Here, we establish a mechanistic connection between these in- 
dependent clinical observations. In infection and inflammatory disor- 
ders, the y6 T cell/IL-17/neutrophil axis drives disease pathogenesis™’”**”. 
We now demonstrate that this targetable pathway also perpetuates breast 
cancer metastasis. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. The generation and characterization of K14cre;Cdhi™. sTrp53" /F (KEP) 
mice—a conditional model of invasive lobular breast cancer—has been described*. 
KEP mice were backcrossed onto the FVB/N background. KEP mice were crossed 
with Rag! ’~ mice (FVB/N; a gift from L. Coussens)"! to generate KEP;Rag] */~ 
and KEP;Ragl ~'~ mice?*. The onset of mammary tumour formation was mon- 
itored twice weekly by palpation and calliper measurements starting at 4 months of 
age. Terd ’~ mice on the FVB/N background were a gift from A. Hayday*. The 
spontaneous metastasis model has also been described”. Briefly, this model is based 
on the orthotopic transplantation of KEP tumour pieces into 10- to 12-week-old 
female recipient FVB/N mice, Rag ~~ mice, Tcrd*’~ or Tcrd‘~ mice. These tu- 
mour pieces are allowed to grow out, then surgically removed at 100 mm’, after 
which 100% of mice develop overt metastatic disease. 

To deplete immune cells or neutralize cytokines, mice were injected intraperito- 
neally with an initial 400 pg followed by 100 ig thrice weekly for anti-Ly6G (clone 
1A8; BioXCell), 200 jig twice weekly for anti-CD8 (clone 2.43; BioXCell) or 100 pg 
twice weekly for anti-ySTCR (clone GL3; purified by the NKI protein facility). For 
cytokine neutralization experiments, KEP mice were injected intraperitoneally with 
50 Lg twice weekly for anti-IL-17A (clone 17F3; BioXCell), 50 ug thrice weekly for 
anti-G-CSF (clone 67604; R&D Systems), 50 jig twice weekly anti-IL-23p19 (clone 
G23-8; eBioscience) or 50 1g twice weekly anti-IL-1f (clone B122; BioXCell). Con- 
trol mice received equal amounts of isotope control antibodies or equal volumes of 
PBS. Where indicated, WT and KEP mice were injected intraperitoneally with 5 pg 
rG-CSF (Peprotech) for four consecutive days and were killed on the fifth day. 
Tumour-bearing KEP mice injected with rG-CSF received anti-IL-17 at the same 
schedule as above. Antibody injections began when KEP mammary tumours reached 
25 mm? until death at 225 mm’, or transplanted tumours reached 9 mm” where 
indicated until surgery at 100 mm. Three independent KEP donor tumours were 
tested in neutrophil depletion experiments resulting in the same outcome. One of 
these donor tumours was used throughout the remainder of the study. Blood sam- 
ples were taken before and during antibody injections for flow cytometry analyses. 
Animals were randomized before beginning the treatment schedule. Mice were 
kept in individually ventilated and open cages, and food and water were provided 
ad libitum. Animal experiments were approved by the Animal Ethics Committee 
of the Netherlands Cancer Institute and performed in accordance with institutional, 
national and European guidelines for Animal Care and Use. 
Immunohistochemistry. Formalin-fixed tissues were processed by routine pro- 
cedures. Haematoxylin and eosin staining was performed as described’. Citrate 
antigen retrieval was used for all staining procedures. Neutrophils were detected 
using either anti-Ly6B (clone 7/4; Cedarlane) or anti-Ly6G (clone 1A8; BD Bio- 
sciences), when primary tumours reached 100 mm’. Quantitative analysis of neu- 
trophil abundance was performed by counting cells in five high-power (40) fields 
of view (FOV) per tissue. Metastases were detected using anti-cytokeratin 8 (clone 
Troma 1; Developmental Studies Hybridoma Bank, University of Iowa). In the me- 
tastasis model, the total number of cytokeratin 8* nodules was scored in one lung 
section of each animal. The size of each nodule was measured using ImageJ, then 
represented as arbitrary units. Lymph node metastases were scored as positive or 
negative on the basis of the presence of cytokeratin 8* metastases. Mice that de- 
veloped overt metastatic disease (that is, respiratory distress or 225 mm” axillary 
lymph node metastasis) were included in the analysis; mice that were killed as a 
result of local recurrence were excluded. Vimentin, E-cadherin and CD34 staining 
was performed as previously described’ and scored independently by two blinded 
researchers. Microvessel density was scored by averaging the total number of blood 
vessels from five fields of view for each tumour section. For metastasis quantifica- 
tionin K14cre;Cdhi"™"; Trp53"” *:Ragl */ and K14cre;Cdhi™; Trp53" M *:Ragl ~~ mice, 
single lung or lymph node sections were scored as positive or negative on the basis 
of the presence of cytokeratin 8* metastases. Stained slides were digitally processed 
using the Aperio ScanScope and captured using ImageScope software version 11.0.2. 
Brightness and contrast for representative images were adjusted equally among 
groups. 

Flow cytometry and intracellular staining. Tissues were collected in ice-cold PBS. 
Blood samples were collected in tubes containing heparin. Tumours and lungs were 
mechanically chopped using a McIlwain tissue chopper (Mickle Laboratory Engi- 
neering). Tumours were digested for 1 h at 37 °C in3 mg ml * collagenase type A 
(Roche) and 25 wg ml ~ 1 DNase (Sigma) in serum-free DMEM medium. Lungs were 
digested for 30 min at 37 °C in 100 jg ml * Liberase TM (Roche). Enzyme activity 
was neutralized by addition of cold DMEM/8% FCS and suspension was dispersed 
through a 70 1m cell strainer. Spleen, lymph nodes and liver were mashed through 
a 70 um cell strainer. All single-cell suspensions were treated with NH,Cl eryth- 
rocyte lysis buffer. Cells were stained with directly conjugated antibodies (listed 
below) for 30 min at 4 °C in the dark in PBS/1% BSA. 7AAD (1:20; eBioscience) or 
Fixable Viability Dye eFluor 780 (1:1,000; eBioscience) was added to exclude dead 
cells. For intracellular staining, single-cell suspensions were stimulated in IMDM 
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containing 8% FCS, 100IUml’ penicillin, 100 pgml~’ streptomycin, 0.5% 
B-mercaptoethanol, 50 ng ml | PMA, 1 uM ionomycin and (1:1,000) Golgi-Plug 
(BD Biosciences) for 3h at 37 °C. Surface antigens were stained first, followed by 
fixation and permeabilization using the Cytofix/Cytoperm kit (BD Biosciences), 
then staining of intracellular proteins. All experiments were performed using a BD 
LSRII flow cytometer using Diva software. Data analyses used FlowJo Software ver- 
sion 9.7.1. Median fluorescence intensity of IL-17A expression was calculated after 
gating on IL-17* cells within individual T cell subsets. 

All antibodies were purchased from eBioscience, except Ly6G-AlexaFluor 700, 
CCR6 and Vy1 from BioLegend, and VEGFR1, CCR2, IL-23R and IL-1R1 from 
R&D Systems. The following antibodies were used in these experiments. 

Myeloid panel: CD45-eFluor 605NC (1:50; clone 30-F11), CD11b-eFluor 650NC 
(1:400; clone M1/70), Ly6G-AlexaFluor 700 (1:400; clone 1A8), Ly6C-eFluor 450 
(1:400; clone HK1.4), F4/80-APC-eFluor 780 (1:200; clone BM8), VEGFR1-APC 
(1:50; clone 141522), cKIT (1:400; clone 2B8), CCR2-PE (1:50; clone 475301), 
CXCR4-PerCP-eFluor 710 (1:400; clone 2B11), CD49d-FITC (1:400; clone R1-2) 
or Gr1-FITC (1:400; clone RB6-8C5), 7AAD. 

Lymphoid panel I: CD45-eFluor 605NC (1:50; clone 30-F11), CD11b-eFluor 
650NC (1:400; clone M1/70), CD3-PE-Cy7 (1:200; clone 145-2C11), CD4-APC- 
eFluor 780 (1:200; clone GK1.5), CD8-PerCP-eFluor 710 (1:400; clone 53-6.7), 
y5TCR-FITC (1:400; clone GL3), CD49b-APC (1:400; clone DX5), IL-17A-PE 
(1:200; clone eBiol17B7), IFNy-eFluor 450 (1:200; clone XMG1.2), 7AAD. 

Lymphoid panel II: CD45-eFluor 605NC (1:50; clone 30-F11), CD11b-APC- 
eFluor 780 (1:200; clone M1/70), CD3-PE-Cy7 (1:200; clone 145-2C11), CD4-APC- 
eFluor 780 (1:200; clone GK1.5), CD8-PerCP-eFluor 710 (1:400; clone 53-6.7), 
ySTCR-PE (1:400; clone GL3), CD49b-APC (1:400; clone DX5), CD62L-AlexaFluor 
700 (1:400; clone MEL-14), CD44-FITC (1:400; clone IM7), IFNy-eFluor 450 
(1:200; clone XMG1.2), CD19-APC-eFluor 780 (1:200; clone eBiol D3), Fixable 
Viability Dye eFluor 780. 

6 T cell phenotyping panel I: CD45-eFluor 605NC (1:50; clone 30-F11), CD11b- 
APC-eFluor 780 (1:200; clone M1/70), CD3-PerCP-eFluor 710 (1:200; clone 145- 
2C11), CD4-APC-eFluor 780 (1:200; clone GK1.5), yYSTCR-PE (1:400; clone GL3), 
CD19-APC-eFluor 780 (1:200; clone eBiolD3), CD27-PE-Cy7 (1:200; clone 
LG.7F9), IL-1R1-FITC (1:25; clone 129304), CCR6-Brillant Violet 421 (1:200: 
clone 29-2L17), IL-23R-AlexaFluor 700 (1:25; clone 753317), RORyt-APC (1:100; 
clone B2D), Fixable Viability Dye eFluor 780. 

65 T cell phenotyping panel IT: CD45-eFluor 605NC (1:50; clone 30-F11), CD11b- 
APC-eFluor 780 (1:200; clone M1/70), CD4-APC-eFluor 780 (1:200; clone GK1.5), 
y5TCR-PE (1:400; clone GL3), CD19-APC-eFluor 780 (1:200; clone eBiolD3), 
CD27-PE-Cy7 (1:200; clone LG.7F9), Vy1-FITC (1:100; clone 2.11), Vy4-PerCP- 
eFluor 710 (1:100; clone UC3-10A6), IFNy-eFluor 450 (1:200; clone XMG1.2), IL- 
17A-APC (1:200; clone eBiol17B7), Fixable Viability Dye eFluor 780. 

White blood cell counts. Total white blood cell numbers were measured on a 
haematology analyser (Becton Dickinson). Neutrophil numbers were then calcu- 
lated on the basis of the percentage of CD11b*Ly6G* Ly6C” cells. 

Giemsa staining. Blood was collected by tail vein puncture in heparin-coated 
tubes. Red blood cells were lysed with NH, Cl lysis buffer. White blood cells were 
smeared onto glass slides then stained with decreasing concentrations of Wright- 
Giemsa solution. 

RNA-seq. Ly6G* neutrophils were isolated by magnetic column (Miltenyi) from 
blood of mice. KEP mice with mammary tumours around 225 mm‘ in size and 
age-matched WT mice were used. Purity of isolated neutrophils was validated by 
flow cytometry and only samples greater than 90% purity were used. RNA was iso- 
lated using Trizol and then treated with DNase I (Invitrogen). Samples were put 
over a Qiagen RNeasy column for cleanup. RNA quality was confirmed with a 2100 
Bioanalyzer from Agilent. RNA-Seq libraries were prepared using the reagents 
provided in the [lumina TruSeq RNA Sample Preparation Kit, following the man- 
ufacturer’s protocol. Libraries were PCR amplified for 12 cycles and sequenced on 
an Illumina HiSeq 2000 System with TruSeq reagent kits and software, generating 
paired-end 51 base-pair reads. Sequence reads were aligned to the mouse reference 
genome (National Center for Biotechnology Information build 37) using TopHat. 
HTSeq-count was then used to generate a list of the total number of uniquely map- 
ped reads for each gene and sample. Sequence reads were normalized to 10 million 
reads per sample and log, transformed with the formula, log,(((expression gene 
X = library size)10°)+ 1), where the library size was the sum of all expression values 
per sample. To determine which genes were differentially expressed between sam- 
ples, the R package Limma was used. Absolute gene expressions were used as input 
and genes with no expression in any sample were removed from the data set. Voom 
was used to transform the count data to log, counts per million and estimation of the 
variance. The P value was set to a cut-off of 0.05, resulting in 100 significant, differ- 
entially expressed genes. Unsupervised clustering was performed on these 100 genes 
and the data were transformed into a heat-map. 
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Real-time PCR. Neutrophil RNA was extracted as above then converted to com- 
plementary DNA (cDNA) with an AMV reverse transcriptase using Oligo(dT) 
primers (Invitrogen). cDNA (20 ng per well) was analysed by SYBR green real- 
time PCR with 500 nM primers using a LightCycler 480 thermocycler (Roche). 
B-actin was used as a reference gene. The following primer sequences were used for 
each gene: Nos2 forward 5'-GTTCTCAGCCCAACAATACAAGA-3’, reverse 5’- 
GTGGACGGGTCGATGTCAC-3’; Prok2 forward 5'-CTTCGCCCTTCTTCTT 
TCCT-3’, reverse 5’'-GCATGTGCTGTGCTGTCAGT-3’; $100a8 forward 5'-T 
GAGCAACCTCATTGATGTCTACC-3’, reverse 5’-ATGCCACACCCACTTTT 
ATCACC-3’; $100a9 forward 5'-GAAGAAAGAGAAGAGAAATGAAGCC-3’; 
reverse 5’-CTTTGCCATCAGCATCATACACTCC-3’; Il1b forward 5’-CAACC 
AACAAGTGATATTCTCCATG-3’, reverse 5’-GATCCACACTCTCCAGCTG 
CA-3'; Actb forward 5'-CCTCATGAAGATCCTGACCGA-3’, reverse 5’-TTT 


GATGTCACGCACGATTTC-3’. Fold change was calculated using the formula 
9 AG. — XAGWT)). 


T cell proliferation assay. Blood neutrophils from WT mice and splenic CD8* T 
cells from WT mice were isolated by magnetic column (Miltenyi). Blood neutro- 
phils from KEP mice with mammary tumours around 225 mm‘ in size were also 
used. CD8* T cells were labelled with Cell Trace CFSE following the manufacturer’s 
instructions (Invitrogen). Equal numbers of cells (2 10°) were co-cultured in a 
96-well flat bottom plate. CD3/CD28 Dynabeads (Invitrogen) were added accord- 
ing to manufacturer’s instruction, and the iNOS inhibitor, L-NMMA (Sigma), was 
added at 0.5 mM where indicated. After 48 h, T cell proliferation was evaluated on 
a BD LSRII flow cytometer using Diva software using the following antibodies: 
CD8a-PE (1:600; clone 53-6.7), CD11b-APC (1:400; clone M1/70), Ly6C-eFluor 
450 (1:400; clone HK1.4), Ly6G-AlexaFluor 700 (1:400; clone 1A8) and 7AAD 
viability marker. Data analyses used FlowJo Software version 9.7.1. Proliferation 
index was calculated using the formula (percentage of proliferated, co-cultured 
CD8* T cells)/ (perentage of proliferated CD8* T cells without co-culture) X 100, 
for each replicate experiment. 


Cytokine analysis. Multiplex quantification of cytokines and chemokines in mam- 
mary glands and tumours was performed using the premixed 24-plex Bio-Plex Pro 
Mouse Cytokine Assay (Bio-Rad) according to the manufacturer’s recommenda- 
tions. Protein lysates were prepared as previously described**. Unsupervised clus- 
tering was performed on normalized, median-centred data then converted to a 
heat-map using Genesis software. For IL-17A and G-CSF serum levels, BD Cyto- 
metric Bead Arrays were used as directed and analysed on a Cyan flow cytometer 
with Summit software (Beckman Coulter). Data analyses used FlowJo Software 
version 9.7.1. For TGF-B1, a DuoSet ELISA kit was purchased from R&D Systems 
and performed according to the manufacturer’s instructions. 

PCRarray. The spleens of three WT or KEP mammary tumour-bearing mice were 
pooled and labelled with anti-CD3 antibodies. CD3* T cells were sorted using a 
BD FACSAria II. RNA was isolated with Trizol as above. Gene expression differ- 
ences were analysed using a mouse T cell-specific PCR array from Qiagen accord- 
ing to their instructions and software. Genes exhibiting a threefold change were 
considered biologically relevant. 

Statistical analysis. Data analyses used GraphPad Prism version 7. Applied ana- 
lyses are indicated in corresponding legends. No statistical methods were used to 
predetermine sample sizes. Sample sizes were based on previous experience with 
the models”****. Differences with P < 0.05 were considered statistically significant. 
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Extended Data Figure 1 | Systemic neutrophil expansion and accumulation 
in mammary tumour-bearing KEP mice and the metastasis model. 

a, Representative images of neutrophils identified by the 7/4 antibody in 
lung sections in WT or KEP mice. Scale bar, 50 um. b, Quantification of 
neutrophil accumulation per field of view (FOV) in various organs by 
immunohistochemistry using the 7/4 antibody (n = 6 WT, 9 KEP mice). 

c, Absolute neutrophil counts in blood of WT and tumour-bearing KEP mice 
(n =4 WT, 8 KEP). d, Quantification of neutrophil accumulation in various 
organs determined by flow cytometry and gated on CD45" cells. Neutrophils 
were not detectable in WT mammary glands (n = 5 WT, 7 KEP mice). 

e, Representative images of Ly6G-stained lung sections and quantification 

of neutrophil accumulation in the metastasis model. Data were generated 
from mock-transplanted, non-tumour-bearing mice (0 mm’), or 


tumour-transplanted recipient mice killed when tumours reached the tumour 
size shown or when mice exhibited signs of respiratory distress due to 
pulmonary metastasis. For quantification in lungs with metastases, neutrophils 
residing inside metastases were excluded. T, pulmonary metastatic lesion. 
Scale bar, 100 um (n = 3, 5, 6, 6 and 3 mice for 0, 9, 25, 100 mm” and metastasis 
respectively). f, Kinetics of neutrophil accumulation in various organs of the 
metastasis model by flow cytometry after gating on CD45" cells. Recipient 
mice transplanted with KEP tumour pieces were killed at the tumour size 
shown (n = 6, 5, 6, and 7 mice for 0, 9, 25, 100 mm” respectively). g, Kinetics of 
neutrophil proportions in blood (gated on CD45" cells), before and after 
surgical removal of their primary tumour (n = 5). All data are mean + s.e.m. 
*P < 0.05, **P < 0.01, ***P < 0.001 as determined by Mann-Whitney U-test 
or Kruskal-Wallis test followed by post-hoc Dunn’s test. 
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here to avoid false-negative results since the anti-Ly6G depleting antibody may —_ Scale bar, 100 um. f, Quantification of blood vessels per field of view (FOV) in 
mask the Ly6G epitope. CD11b* Gr1"®" cells were Ly6CCCR2_, indicating _ control and neutrophil-depleted mice by anti-CD34 immunohistochemistry 
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©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


[oy 


Ly6G 
% CKIT* of 
total neutrophils 


Ly6C 


cKIT 


Cc Tumor Blood Lung 


* 
15 tg 


% cKIT* of 
total neutrophils 
=& nN w os a 
° a 3 a 
+ |e, 
* 
a 3 


0 
0 9 25 100 0 9 25 100 0 9 25 100 


Tumor size (mm?) — 


Lymph node Spleen Liver 
2 * 
‘6 & 15 
i& 
ES, 
G2 
SB 5 
2 
0 9 25 100 0 9 25 100 0 9 25 100 
Tumor size (mm?) 
d e KEP 
. * 


= 
a 
= 
jo} 
oO 


80: 


% CKIT* of 
total neutrophils 
on 2 


no 
a 
a 
2°98 
sc 
nD 
5 2 
85 
SE 
D 
®o 
” 


9 251001 2 3 4 


Tumor size Weeks 
(mm?) after 


surgery 


©2015 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 3 | Subpopulations of neutrophils in mammary 
tumour-bearing mice are immature. a, Gating strategy for identification of 
neutrophils (CD45'CD11b* Ly6G* Ly6C* F4/80° cells), cCKIT* neutrophils 
and monocytes (CD45*CD11b*Ly6G™ Ly6C* F4/80~ cells) by flow 
cytometry. Blood cells from WT and tumour-bearing KEP mice are 

shown here. b, Quantification of cKIT* neutrophil accumulation in 

various organs determined by flow cytometry after gating on 
CD45*CD11b*Ly6G*Ly6C*F4/80~ cells. CKIT* neutrophils were not 
detectable in WT mammary glands (n = 5 WT, 7 KEP; Mann-Whitney 
U-test). ¢, CKIT* neutrophil proportions in various organs of the 

metastasis model as determined by flow cytometry after gating on 

CD45‘ CD11b*Ly6G* Ly6C* F4/80° cells. Mice were killed at the tumour size 


LETTER 


shown (n = 5, 5, 5, and 8 mice for 0, 9, 25, 100 mm? respectively; Kruskal- 
Wallis test followed by post-hoc Dunn’s test). d, Kinetics of CKIT~ neutrophil 
proportions in blood (gated on CD45*CD11b*Ly6G*Ly6C* F4/80- cells), 
before and after surgical removal of the primary tumour (n = 5 per group; 
Mann-Whitney U-test). e, Representative images and quantification of 
neutrophil nuclear morphology. Ly6G* cells were isolated from blood of 
WT and tumour-bearing KEP mice then assessed by Giemsa stain. Hyper- 
segmented cells were considered mature, whereas all other cells were 
considered immature. Scale bar, 10 um. (n = 6 WT, 5 KEP mice; Mann- 
Whitney U-test). All data are mean + s.e.m. *P < 0.05, **P < 0.01, 

***P < 0,001. 
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Extended Data Figure 4 | Neutrophils influence the function and 
phenotype of CD8* T cells. a, Unsupervised hierarchical clustering of 
RNA-Seq analysis depicting 100 differentially expressed genes between 
circulating neutrophils from WT and tumour-bearing KEP mice. P value (0.05) 
was used as cutoff (n = 4 WT, 5 KEP mice). See also Extended Data Table 1 for 
top 50 genes ranked by fold change. b, Circulating neutrophils from either 
WT or tumour-bearing KEP mice were incubated with CFSE-labelled splenic 
CD8* T cells from WT mice and CD3/CD28 stimulation beads. The iNOS 
inhibitor, L_NMMA, was added where indicated. After 48h, CD8™ T cell 
proliferation was measured by flow cytometry. c, Dot plots depicting live 
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cell-gated CD8* T cell proportions in lungs of mice in control and 
neutrophil-depleted mice, killed when transplanted tumours reached 

100 mm”. d, Dot plots of effector CD8* T cell (CD62L” CD44") proportions 
in lungs of transplanted mammary tumour-bearing mice that were killed 
when tumours reached 100 mm”. e, IFN-y expression by CD8* T cells in lungs 
of transplanted mammary tumour-bearing mice that were killed when 
tumours reached 100 mm”. f, Tumour growth kinetics in neutrophil-depleted 
or combined neutrophil- and CD8* T-cell-depleted, mammary tumour- 
transplanted recipient mice, compared with control (n = 13 control, 

21 anti-Ly6G, 14 anti-Ly6G/CD8). Data are mean + s.e.m. 
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Extended Data Figure 5 | Cytokine expression levels in tumours and T cells, 
and their effects on neutrophils. a, Unsupervised clustering of cytokine 
expression analysis in WT mammary glands and KEP tumours. Protein lysates 
were prepared as previously described from whole tissue** and analysed for 
expression of various cytokines by Luminex-based assay (n = 5 per group). 
b, Protein levels of indicated cytokines in WT mammary glands and KEP 
tumours, determined by Luminex-based cytokine profiling; n.d., not detectable 
(n = 10 per group; Mann-Whitney U-test). c, d, Quantification of neutrophil 
and cKIT-expressing neutrophil accumulation in blood as determined by 
flow cytometry and gated on CD45* cells. WT (n = 4) or tumour-bearing KEP 
mice (n = 9) were treated with anti-IL-17 (n = 8) and/or recombinant G-CSF 
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(rG-CSF; n = 4) where indicated (Mann-Whitney U-test or Kruskal-Wallis 
test followed by post-hoc Dunn’s test). e, Gene expression in circulating 
neutrophils from WT control (n = 5), rG-CSF-treated WT mice (n = 4), KEP 
control (n = 10), anti-IL-17-treated (n = 6), anti-IL-17 + rG-CSF-treated 
KEP mice (n = 4; Mann-Whitney U-test or Kruskal-Wallis test followed by 
post-hoc Dunn’s test). f, Spleens of three WT mice and three KEP mice were 
pooled and CD3* T cells were isolated. These cells were analysed by a real-time 
PCR array containing 86 different genes. Gene expression changes greater 
than threefold are shown. Members of the IL-17 signalling pathway are 
depicted in blue. *P < 0.05, **P < 0.01, ***P < 0.001. All data are 

mean + S.e.m. 
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Extended Data Figure 6 | Absence of the adaptive immune system reduces _ plasma of tumour-bearing mice (n = 6 tumour, 3 plasma). c, Quantification 


metastasis. a, Graphic representation of mammary tumour latency (left) of metastatic burden in lungs of recipient WT or Rag]~/~ mice that were 
and tumour growth (right) in lymphocyte-proficient KEP;RagI*’~ and transplanted with KEP mammary tumour fragments and underwent surgical 
lymphocyte-deficient KEP;Rag] ‘~ mice (n = 30 per group left panel, 10 mice removal of the primary tumour (n = 6 WT, 4 Rag ’~ mice; **P < 0.01, 
per group right panel). b, Levels of TGF-$1 in mammary tumours and the Mann-Whitney U-test). Data are mean + s.e.m. 
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Extended Data Figure 7 | Depletion of CD4* T cells does not affect total neutrophil and cKIT* proportions in blood of control, anti-CD4- and 


systemic cytokine levels or neutrophil expansion. a, The proportion of IL- anti-yOTCR-treated tumour-bearing KEP mice. d, Quantification of total 
17A* cells among CD4” T cells in organs of WT and tumour-bearing KEP neutrophil and cKIT* neutrophil proportions in blood of control and anti- 


mice (n = 6 per group; Mann-Whitney U-test). b, Median fluorescence CD4-treated tumour-bearing KEP mice (n = 7 per group; Mann-Whitney 
intensity of IL-17A expression in circulating y and CD4* T cells from U-test). e, Serum levels of IL-17A and G-CSF in control and anti-CD4-treated 
tumour-bearing KEP mice, as determined by flow cytometry (n = 11 per tumour-bearing KEP mice (n = 10 control, 6 anti-CD4; Mann-Whitney 
group; Wilcoxon matched-pairs test). c, Representative dot plots depicting U-test). *P < 0.05, **P < 0.01. All data are mean + s.e.m. 
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Extended Data Figure 8 | 76 T cell phenotype in KEP mice and their lack of 
influence on tumour growth in the metastasis model. a, 76 T cells from lungs 
of tumour-bearing KEP mice were analysed by flow cytometry for IL-17, 
CD27, Vy1 and Vy4 expression. Two major populations of yé T cells were 
observed including IL-17'CD27° and IL-17 CD27* cells. b, Representative 
histograms of CCR6, IL-1R1, IL-23R and ROR-7t expression in IL-1 7*CD27— 
and IL-17- CD27* y6 T cell populations shown in a. ¢, I/1 8 gene expression in 
various cell populations isolated from transplanted KEP tumours. Tumours 
from three mice were pooled to form one group. CD45~ cells (which 
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include cancer cells, endothelial cells and fibroblasts), CD45 *CD11b* F4/80* 
macrophages, CD45*CD11b*Ly6G* neutrophils and CD45*CD11b~ 
lymphocytes were sorted from two pooled groups. Real-time PCR was 
performed on individual cell populations for I/1f expression. Relative 
expression among different cells is shown. d, Graphic representation of 
immune cell proportions in KEP tumours (n = 4). e, Primary tumour growth 
kinetics of control and 6 T cell-depleted tumour transplant recipient mice 
(n = 13 per group). f, Growth kinetics of primary tumours transplanted into 
Terd*/— (n= 10) and Terd’’~ mice (n = 6). All data are mean + s.e.m. 
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Extended Data Figure 9 | The v6 T cell/IL-17/neutrophil axis promotes alteration of neutrophil phenotype. These phenotypically altered neutrophils 
breast cancer metastasis. Mammary tumours evoke a systemic inflammatory produce iNOS that suppresses the activity of anti-tumour CD8" T cells. As a 
cascade that is initiated by IL-1B production. Tumour-derived IL-1B result of this systemic inflammatory cascade, the chance that disseminated 
activates y6 T cells to produce IL-17. Increased systemic IL-17 levels lead to cancer cells can establish metastases in distant organs is maximized. 


upregulation of G-CSF, which subsequently causes neutrophil expansion and 
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Extended Data Table 1 


Gene ID 


Nos2 
Car4 

Lipg 
Gm11430 
Gm6551 
Stfa3 
Pvrl2 
Gm16748 
Prok2 
Esm1 
Ano2 
Saa7 

Nov 
Gpr15 
Ggt1 

Cish 
Stfa2 
Gm14028 
Stfa1 
Cfhr1 
Ms4a3 

Kit 

Jph3 
Cnnm2 
Gnb5 
Alox12 
Slfn14-ps 
Gm6634 
Syt13 
Tsc22d1 
Gm10419 
Sh3bgrl2 
Fhli1 
Trpc6 
Ctla2a 
Csgalnact1 
Gng11 
2610109HO7Rik 
Nrgn 
Gm11274 
Peg10 
Angpt1 
Plip1 

Dlg2 

Sytl4 

Mpl 

Mras 

Gp6 
Cd226 
Bean1 


Gene name 


nitric oxide synthase 2, inducible 

carbonic anhydrase 4 

lipase, endothelial 

predicted gene 11430 

predicted gene 6551 

stefin A3 

poliovirus receptor-related 2 

predicted gene, 16748 

prokineticin 2 

endothelial cell-specific molecule 1 

anoctamin 2 

serum amyloid A 1 

nephroblastoma overexpressed gene 

G protein-coupled receptor 15 
gamma-glutamyltransferase 1 

cytokine inducible SH2-containing protein 

stefin A2 

predicted gene 14028 

stefin A1 

complement factor H-related 1 

membrane-spanning 4-domains, subfamily A, member 3 
kit oncogene 

junctophilin 3 

cyclin M2 

guanine nucleotide binding protein (G protein), beta 5 
arachidonate 12-lipoxygenase 

schlafen family member 14, pseudogene 

predicted gene 6634 

synaptotagmin XiIll 

TSC22 domain family, member 1 

predicted gene 10419 

SH3 domain binding glutamic acid-rich protein like 2 
four and a half LIM domains 1 

transient receptor potential cation channel, subfamily C, member 6 
cytotoxic T lymphocyte-associated protein 2 alpha 
chondroitin sulfate N-acetylgalactosaminyltransferase 1 
guanine nucleotide binding protein (G protein), gamma 11 
RIKEN cDNA 2610109H07 gene 

neurogranin 

predicted gene 11274 

paternally expressed 10 

angiopoietin 1 

proteolipid protein (myelin) 1 

discs, large homolog 2 (Drosophila) 
synaptotagmin-like 4 

myeloproliferative leukemia virus oncogene 

muscle and microspikes RAS 

glycoprotein 6 (platelet) 

CD226 antigen 

brain expressed, associated with Nedd4, 1 


The top 50 most differentially expressed genes between neutrophils from WT and KEP mice 


Ensembl gene ID 


ENSMUSG00000020826 
ENSMUSG00000000805 
ENSMUSG00000053846 
ENSMUSG00000080927 
ENSMUSG00000078100 
ENSMUSG00000054905 
ENSMUSG00000062300 
ENSMUSG00000085308 
ENSMUSG00000030069 
ENSMUSG00000042379 
ENSMUSG000000381 15 
ENSMUSG00000074115 
ENSMUSG00000037362 
ENSMUSG00000047293 
ENSMUSG00000006345 
ENSMUSG00000032578 
ENSMUSG00000022902 
ENSMUSG00000082339 
ENSMUSG00000071562 
ENSMUSG00000057037 
ENSMUSG00000024681 
ENSMUSG00000005672 
ENSMUSG00000025318 
ENSMUSG00000064105 
ENSMUSG00000032192 
ENSMUSG00000000320 
ENSMUSG00000082101 
ENSMUSG00000086538 
ENSMUSG00000027220 
ENSMUSG00000022010 
ENSMUSG00000072769 
ENSMUSG00000032261 
ENSMUSG00000023092 
ENSMUSG00000031997 
ENSMUSG00000044258 
ENSMUSG00000036356 
ENSMUSG00000032766 
ENSMUSG00000029005 
ENSMUSG00000053310 
ENSMUSG00000085331 
ENSMUSG00000092035 
ENSMUSG00000022309 
ENSMUSG00000031425 
ENSMUSG00000052572 
ENSMUSG00000031255 
ENSMUSG00000006389 
ENSMUSG00000032470 
ENSMUSG00000078810 
ENSMUSG00000034028 
ENSMUSG00000031872 


fold 
change 
31.5 
25.5 
24.6 
16.7 
14.2 
14.1 
13.8 


p 
value 
0.345 
0.111 
0.128 
0.292 
0.039 
0.047 
0.143 
0.087 
0.024 
0.010 
0.138 
0.192 
0.295 
0.204 
0.076 
0.217 
0.020 
0.036 
0.087 
0.029 
0.177 
0.085 
0.076 
0.128 
0.004 
0.103 
0.049 
0.139 
0.213 
0.107 
0.231 
0.117 
0.131 
0.076 
0.107 
0.156 
0.123 
0.195 
0.087 
0.101 
0.177 
0.168 
0.132 
0.080 
0.138 
0.197 
0.053 
0.215 
0.134 
0.053 
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MET is required for the recruitment of anti-tumoural 


neutrophils 


Veronica Finisguerra’’, Giusy Di Conza!?, Mario Di Matteo’, Jens Serneels’?, Sandra Costa’?**, A. A. Roger Thompson’, 
Els Wauters®”*, Sarah Walmsley”, Hans Prenen’, Zvi Granot'®, Andrea Casazza!? & Massimiliano Mazzone’? 


Mutations or amplification of the MET proto-oncogene are 
involved in the pathogenesis of several tumours’ *, which rely on 
the constitutive engagement of this pathway for their growth and 
survival’. However, MET is expressed not only by cancer cells but 
also by tumour-associated stromal cells, although its precise role in 
this compartment is not well characterized®''. Here we show that 
MET is required for neutrophil chemoattraction and cytotoxicity 
in response to its ligand hepatocyte growth factor (HGF). Met 
deletion in mouse neutrophils enhances tumour growth and meta- 
stasis. This phenotype correlates with reduced neutrophil infiltra- 
tion to both the primary tumour and metastatic sites. Similarly, 
Met is necessary for neutrophil transudation during colitis, skin 
rash or peritonitis. Mechanistically, Met is induced by tumour- 
derived tumour necrosis factor (TNF)-a or other inflammatory 
stimuli in both mouse and human neutrophils. This induction is 
instrumental for neutrophil transmigration across an activated 
endothelium and for inducible nitric oxide synthase production 
upon HGF stimulation. Consequently, HGF/MET-dependent 
nitric oxide release by neutrophils promotes cancer cell killing, 
which abates tumour growth and metastasis. After systemic 
administration of a MET kinase inhibitor, we prove that the thera- 
peutic benefit of MET targeting in cancer cells is partly countered 
by the pro-tumoural effect arising from MET blockade in neutro- 
phils. Our work identifies an unprecedented role of MET in neu- 
trophils, suggests a potential ‘Achilles’ heel’ of MET-targeted 
therapies in cancer, and supports the rationale for evaluating 
anti-MET drugs in certain inflammatory diseases. 

To ensure Met deletion in the immune system only, we took advant- 
age of the Tie2:Cre deleter that excises floxed genes in both bone 
marrow and endothelial cells'”, and we reconstituted lethally irradiated 
C57BL/6 wild-type (WT) mice with bone marrow cells from 
Tie2;Met’’”' (WT) or Tie2;Met™" (knockout (KO)) mice (Extended 
Data Fig. 2a), producing WI—WT or KO-WT mice, respectively. 
Compared to WI->WT controls, both growth and metastatic burden 
of subcutaneous LLC lung carcinomas were boosted in KO—>WT mice 
(Fig. la-g), with reduced tumour apoptosis and necrosis, increased 
proliferation, but comparable vessel parameters and hypoxia 
(Extended Data Fig. 2b-r). A similar induction in tumour growth 
and metastasis was observed in non-irradiated KO versus WT mice 
(Extended Data Fig. 2s—u), but tumour growth, as well as the vascular 
features, were comparable in WI->WT versus WT->KO chimaeras, 
displaying Met deletion in endothelial cells only (Extended Data Fig. 
20-1, v). Thus, Met deletion in immune cells favours cancer growth 
and metastasis. 

Blood counts and percentage of circulating blood cell subsets did not 
change in WI—WT and KO—WT mice, either at baseline or upon 


LLC tumour engraftment (Extended Data Fig. 3a-e and Extended 
Data Tables 1,2). Notably, KO—+WT mice displayed reduced numbers 
of tumour-infiltrating CD45" leukocytes and, among all the different 
subpopulations, only Ly6G* tumour-associated neutrophils (TANs) 
were strongly reduced in KO—WT versus WI—WT mice at any time 
point (Fig. lh-j and Extended Data Fig. 3f-k). Similarly, lungs from 
KO-—WT tumour-bearing mice contained fewer CD45* leukocytes 
with decreased Ly6G* neutrophil infiltration, whereas the number of 
F4/80 macrophages was comparable (Fig. 1k-m and Extended Data 
Fig. 31, m). Furthermore, reconstitution of Met in neutrophils only’ 
(Extended Data Fig. 4a, b) was sufficient to rescue their recruitment 
and to hinder tumour growth and metastasis in KO—-WT mice 
(Fig. In-q). Vice versa, restricted deletion of Met in neutrophils 
(Mrp8;Me#t™") by the neutrophil-specific Mrp8:Cre line! (Extended 
Data Fig. 4c, d) led to enhanced tumour growth and dissemination, 
and marked TAN reduction, as in KO—WT chimaeras (Fig. lr-u and 
Extended Data Fig. 4e). These results indicate that MET is required for 
recruiting anti-tumoural neutrophils. 

To extend our findings to other tumour types, we proved that Met 
deletion in the haematopoietic system increased the growth of (1) 
orthotopic T241 fibrosarcomas and B16F10 melanomas, (2) spontan- 
eous mammary tumours in MMTV-PyMT™ transgenic mice, (3) 
H-Ras(G12V)- and c-Myc-driven hepatocellular carcinomas (HCCs), 
and (4) chemically induced colorectal cancers (CRCs) (Fig. 2a-j and 
Extended Data Fig. 5a, b). Furthermore, lung colonization of B16F10 
melanoma cells (from either the primary tumour or after cancer 
cell intravenous injection) and of MMTV-PyMT™ breast tumours 
was boosted in Met KO chimaeras (Fig. 2k, 1 and Extended Data 
Fig. 5c). In all these tumour types, Met KO TANs were fewer than 
WT TANSs (Fig. 2m and Extended Data Fig. 5d, e). Interestingly, 
during chronic bowel inflammation (preceding CRC formation), neu- 
trophil but not macrophage infiltration of the colon was also abated by 
haematopoietic Met deletion, but this reduction did not impinge on 
colitis severity (Extended Data Fig. 5f-i). B16F10 melanomas and 
HCCs displayed enhanced tumour growth (and metastasization) as 
well as reduced TAN infiltration in Mrp8;Met” “ versus Mrp8;Met"” WE 
mice (Fig. 2n-q). Conversely, orthotopic Panc02 carcinomas grew 
and metastasized similarly in both WI—WT and KO>WT mice, 
and TAN infiltration did not change (Extended Data Fig. 5j-l). 
However, these tumours produced little HGF compared to LLC 
tumours (Extended Data Fig. 5m, n). In general, plasma and intratu- 
mour HGF did not differ between genotypes (Extended Data 
Fig. 50, p). In sum, Met deficiency in neutrophils promotes the pro- 
gression of different (HGF-secreting) tumours. 

Systemic treatment of WT mice carrying B16F10 melanomas (which 
are dependent on MET”) with three different MET tyrosine-kinase 
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Figure 1 | Met deficiency inhibits neutrophil 
recruitment to tumour and metastatic site. 

a-g, LLC tumour growth (a), weight (b), lung 
macrometastases (c), metastatic area 

(d), representative images of haematoxylin and 
eosin (H&E)-stained lung sections (e, f), and 
metastatic index (g) in WI-—WT and KO>WT 
chimaeras. Data combine three independent 
experiments; total mice: WI WT, 23; 

KO-WT, 26. h-m, Neutrophil quantification and 
representative images on Ly6G-stained LLC 
tumour sections (h-j) or on lung sections from 
tumour-free and tumour-bearing mice 

(k) represented in I, m. Data in h are representative 
of four independent experiments (6 mice per 
condition per experiment). Data in k combine 
three independent experiments; total mice: 
tumour-free, 10 per condition; tumour-bearing, 15 
per condition. n-q, LLC tumour growth 

(n), tumour weight (0), lung macrometastases 
(p), and TAN quantification (q) in WI—WT and 
KO—WT control chimaeras (Mrp8:Empty) or 
upon neutrophil-specific Met reconstitution 
(Mrp8:Met). Data combine two independent 
experiments; total mice, 10 per condition. r-u, LLC 
tumour growth (r), tumour weight (s), lung 


Lung metastatic size (LLC) 
15 = 


o 


a 


— 


0 
WT>WT = KO+WT 


* 


macrometastases (t), and TAN quantification 

(u) upon neutrophil-specific Met deletion 
(Mrp8;Met™") and in controls (Mrp8;Met"”™’), 
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inhibitors (PF-04217903, INCB28060 and JNJ-38877605), strongly 


reduced TAN recruitment (Extended Data Fig. 5q). We then com- 
pared MET silencing in cancer cells versus systemic MET inhibition. 
Systemic administration of PF-04217903 decreased the weight and 
volume of B16F10 melanomas by 36% and 54%, respectively. Instead, 
MET knockdown in cancer cells only, led to 58% and 75% inhibition 
of tumour growth and volume (Fig. 2r, s and Extended Data Fig. 5r). 
However, the combination of these two strategies was not synergic 
but dampened tumour inhibition to the same level as observed with 
PF-04217903 alone (Fig. 2r, s). TAN inhibition by PF-04217903 was 
comparable in both Met-silenced and scrambled B16F10 melanomas 
(Fig. 2t). These data unveil how the therapeutic benefit of systemic 
MET inhibition is partly blunted by the blockade of anti-tumoural 
neutrophils. 

To date, MET expression in neutrophils has been poorly documen- 
ted’'. We thus measured MET levels in circulating or tumour-infilt- 
rating neutrophils. Circulating Ly6G* cells from healthy mice 
expressed low MET, but these levels were increased in circulating 
neutrophils from LLC-tumour-bearing mice and even further in 
TANSs (Fig. 3a, b and Extended Data Fig. 6a). Similarly, neutrophils 
isolated from non-small-cell lung tumours displayed much higher 
MET levels than in the healthy tissue (Fig. 3c). 

Co-culture with interleukin (IL)-la% pre-activated endothelium 
as well as stimulation with tumour- or cancer-cell-conditioned 
medium (TCM or CCM, respectively) promoted MET expression 
in both mouse and human neutrophils (Fig. 3d-g). In a biased 
approach’*"’’, we found that TNF-« and lipopolysaccharide (LPS) 
(but not IL-l, HGF, or hypoxia) induced MET expression in 
both mouse and human neutrophils (Fig. 3h and Extended Data 
Fig. 6b-e; data not shown). TNF-o-mediated MET induction 
required TNFR1 and subsequent nuclear factor (NF)-«B activation 
(Fig. 3i-k). TNF-a alone was not able to trigger either MET 
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phosphorylation or HGF release in neutrophils (Extended Data 
Fig. 6f-h). 

Silencing of endothelial-cell-borne TNF-« (which is 250-fold 
increased upon stimulation with IL-10; Extended Data Fig. 6i), knock- 
out of neutrophil-borne TNFR1 (but not of TNFR2), and pharmaco- 
logical blockade of TNF-« with the TNF-o-trap Enbrel, prevented 
MET induction in mouse or human neutrophils upon co-culture with 
activated endothelial cells or exposure to TCM/CCM (Fig. 31 and 
Extended Data Fig. 6j-m). Finally, systemic administration of Enbrel 
in LLC-tumour-bearing mice diminished MET expression in neutro- 
phils as well, resulting in their reduced recruitment to the tumour 
(Fig. 3m, n). Although MET is scarcely expressed in naive neutrophils, 
it is strongly induced by inflammatory stimuli. 

Mechanistically, impaired TAN accumulation after Met inactiva- 
tion was not due to cell death as assessed in LLC tumours and in 
culture, both at baseline and under LPS stimulation, with or without 
HGF (Extended Data Fig. 7a—-e), but rather to a defect in neutrophil 
recruitment from the blood. Indeed, also in the case of acute inflam- 
mation, Met KO neutrophils displayed reduced exudation to the skin 
or to the peritoneal cavity (Fig. 4a-d and Extended Data Fig. 8a, b). 
Macrophage and lymphocyte recruitment did not change (Fig. 4d and 
Extended Data Fig. 8c, d). Vice versa, recombinant HGF recruited WT 
neutrophils inside subcutaneous air pouches with similar efficacy to 
the neutrophil chemoattractant CXCL1 (Fig. 4e and Extended Data 
Fig. 8e). In contrast, Met KO neutrophils did not migrate towards 
HGF, although their response to CXCL1 was preserved (Fig. 4e and 
Extended Data Fig. 8e). Mirroring this approach, an anti- HGF block- 
ing antibody’* prevented neutrophil infiltration to tumours and 
inflamed skin (Fig. 4f). 

We then tested the relevance of MET for neutrophil migration. 
Stimulation of WT neutrophils with HGF promoted their adhesion 
and chemotaxis through an activated endothelium whereas Met KO 
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Figure 2 | Met deficiency in haematopoietic cells fosters progression of 
several tumour types. a-c, Growth of T241 fibrosarcomas (a), B16F10 
melanomas (b) and PyMT-driven breast tumours (c). Data in a, b combine two 
independent experiments; total mice, 14 per condition (a); 8 per condition 
(b). Data in c combine three independent experiments; total mice: 
WT-PyMT, 13; KO—PyMT, 16. d-g, Liver weight (d), nodules (e), and 
images (f, g) after H-Ras(G12V)/c-Myc-driven HCC (n = 4 mice per 
condition). h-j, Quantification (h) on H&E-stained bowel sections (i, j) of 
colon adenomas (yellow arrowheads) and carcinomas (red arrowheads) after 
administration of azoxymethane (AOM) and dextran sodium sulfate (DSS). 
Data combine two independent experiments; total mice, 10 per condition. 

k, Spontaneous lung metastases from B16F10 tumours or lung colonization 
after B16F10 intravenous injection. Data combine two independent 
experiments; total mice, 8 per condition. 1, Lung macrometastases from PyMT 
tumours. Data combine three independent experiments; total mice: 


neutrophils (displaying an 85% reduction in MET protein levels com- 
pared to WT; Extended Data Fig. 2a) completely lost this response 
(Fig. 4g, h and Extended Data Fig. 8f). In line with this, TCM (contain- 
ing 2.6+0.3ngml_' HGF) promoted transendothelial migration of 
WT neutrophils, but its effect was 43% lower on Met KO neutrophils 
(Fig. 4i). Upon HGF neutralization, WT neutrophils responded to 
TCM only partially, as did Met KO neutrophils (Fig. 4i). Neither 
HGF nor TCM influenced neutrophil behaviour on non-activated 
endothelial cells or bare membranes (Extended Data Fig. 8f-h). 
Hence, HGF-mediated MET activation is required for neutrophil 
transendothelial migration to the inflammatory site. 


WT-PyMT, 13; KO-PyMT, 16. m, TAN quantification in T241, B16F10, 
PyMT, HCC and CRC tumour tissues. Total mice: T241 and CRC, 10 per 
condition; B16F10, 8 per condition, combining two experiments; 
WT-PyMT, 13; KO->PyMT, 16, combining three experiments; HCC, 4 per 
condition (one experiment). n, 0, B16F10 tumour growth (n), spontaneous 
metastases and lung colonization (0) in Mrp8;Met"” Wand Mrp8;Met" fl mice. 
Data combine two independent experiments; total mice: Mrp8;Met’”, 12; 
Mrp8;Met™", 8. p, Liver nodules in Mrp8;Met””” and Mrp8;Me#™" HCC- 
bearing mice. Total mice, 7 per condition. q, TAN quantification in B16F10 
tumours (total mice, 8 per condition) or HCCs (total mice, 7 per condition). 
r-t, Tumour growth (r), weight (s) and TANs in short hairpin RNA (shRNA) 
Met-silenced (shMet) and scrambled B16F10 tumours after PF-04217903 or 
vehicle treatment. Data combine two independent experiments (total mice, 11 
per condition for scramble; 14 per condition for shMet). *P < 0.05. Scale bars: 
0.5 cm (f, g); 200 um (i, j). Graphs show mean = s.e.m. 


Once migrated inside the tumour, N1 or N2 neutrophils can inhibit 
or favour tumour progression, respectively’. Among the N1 and N2 
genes, only the expression of the N1 marker inducible nitric oxide 
synthase (Nos2, also known as iNos) was lower in Met KO versus 
WT TANs but similar in macrophages (Fig. 4) and Extended Data 
Fig. 8i). Compared to WI—WT mice, tumours harvested from 
KO—WT mice displayed reduced nitric oxide (NO) production and 
3-nitrotyrosine (3NT) formation, a sign of NO-mediated cell damage 
(Fig. 4k-n). In vitro, Met KO TANs had lower cancer-cell-killing capa- 
city than WT TANs; iNOS inhibition by N°-monomethyl-L-arginine 
(L-NMMA) blunted this difference (Fig. 40 and Extended Data Fig. 8)). 
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Figure 3 | Met expression in neutrophils is induced by tumour-derived 
soluble factors. a, b, Quantitative polymerase chain reaction with reverse 
transcription (qRT-PCR) (a) and fluorescence-activated cell sorting (FACS) 
(b) analysis for MET in blood neutrophils from tumour (TM)-free or LLC- 
tumour-bearing mice and in TANs. c, (RT-PCR for MET in human 
neutrophils from lung cancer versus healthy tissue. n = 4 patients. d, e, MET 
expression by qRT-PCR (d) and western blot (e) in circulating neutrophils 
from tumour-free mice after co-culture with unstimulated (HUVEC/NS) or IL- 
1a-pre-stimulated (HUVEC/IL) human umbilical vein endothelial cells 
(HUVECs), or after stimulation with TCM or CCM. f, g, RT-PCR (f) or 
western blot (g) for MET in circulating human neutrophils after stimulation 
with A549-CCM. h, Western blot for MET in mouse and human neutrophils 
after LPS or TNF-o stimulation. i, RT-PCR for Met in WT, TNFR1 KO or 


TNER2 KO mouse neutrophils after TNF-« stimulation. j, k, qRT-PCR (j) and 
western blot (k) for MET in mouse neutrophils after TNF-« stimulation with or 
without NF-«B inhibitor. 1, qRT-PCR for Met in mouse neutrophils co- 
cultured with HUVEC/IL or stimulated with TCM in the presence or absence of 
Enbrel. m, n, FACS for MET in TANs (m) and immunohistochemistry for 
Ly6G (n) in LLC tumours after Enbrel. Data combine two independent 
experiments; total mice, 5 per condition. All data in a, b, d, f, i, j, Lare 
representative of two independent experiments using four biological replicates 
per condition per experiment. All western blots were repeated three times on 
independent biological replicates. Full western blot images are shown in 
Supplementary Fig. 1. Loading control in e, h displays tubulin or actin 
according to Supplementary Fig. 1. *P < 0.05; +P < 0.05 versus mock (i, 1) or 
versus untreated (j). Graphs show mean + s.e.m. 
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Figure 4| MET is required for neutrophil transendothelial migration and 
cytotoxicity. a—c, Neutrophil quantification (a) on 12-O- 
tetradecanoylphorbol-13-acetate (TPA)-painted ear skin (b, c). Data combine 
two independent experiments; total mice: 10 per condition for vehicle; 14 per 
condition for TPA. d, FACS analysis for Ly6G* neutrophils or F4/80* 
macrophages on peritoneal lavages after zymosan-induced peritonitis. Data are 
representative of two independent experiments using 4 mice per condition per 
experiment. e, FACS analysis for neutrophil recruitment towards HGF or 
CXCLI in air pouch assays. Data combine three independent experiments; total 
mice: 10 per condition. f, Ly6G infiltration in LLC tumours or in TPA-painted 
ear skin after anti- HGF. Data combine two independent experiments; total 
mice: IgG, 11; anti- HGF, 6. g-i, Neutrophil adhesion to HUVEC/IL (g) and 
transendothelial migration in response to HGF (h) or TCM with or without 
anti-HGF (i). Data in g-i are representative of three independent experiments 
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using three biological replicates per condition per experiment. WBCs, white 
blood cells. j, k, (RT-PCR for Nos2 in LLC-tumour-associated neutrophils or 
macrophages (j) and tumour-derived NO production (k). I-n, Quantification 
(1) and representative images (m, n) of 3NT and Ly6G co-stained LLC tumour 
sections. DAPI, 4’,6-diamidino-2-phenylindole. Data in j-n combine two 
independent experiments; total mice: 8 per condition. 0, TAN cytotoxicity 
against LLC cells with or without L-NMMA. Data are representative of four 
independent experiments using three biological replicates per condition per 
experiment. p-r, LLC tumour growth (p), weight (q) and TANs (r) after 
neutrophil-depleting anti-Ly6G treatment. Data combine two independent 
experiments; total mice: 16 per condition. *P < 0.05; +P < 0.05 versus PBS 
(e), versus mock (g-i), and versus WI—WT untreated (0). Scale bars: 100 um 
(b, c); 20 um (m, n). Graphs show mean = s.e.m. 
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HGF-stimulated WT but not Met KO neutrophils displayed enhanced 
NO release and cytotoxicity, which was abated by L-NMMA 
(Extended Data Fig. 8k, 1). 

We then hypothesized that HGF/MET pathway is key for anti- 
tumoural neutrophils only. Neutrophil depletion in WI-—WT chi- 
maeras did not affect LLC tumour growth, implying that in this 
tumour model anti-tumoural and pro-tumoural neutrophils are in 
balance (Fig. 4p-r). The same treatment in KO—WT mice reduced 
tumour growth by 34% (Fig. 4p-r), indicating that Met deletion inhi- 
bits recruitment and activation of cytotoxic, but not of pro-tumoural 
neutrophils, which are instead blocked by the anti-Ly6G antibody 
(Fig. 4r and Extended Data Fig. 8m). 

In sum, we demonstrate that MET is induced by inflammatory 
stimuli. This receptor is then required for neutrophil extravasation 
to inflamed tissues. Extravasated neutrophils respond to HGF by pro- 
ducing cytotoxic nitric oxide (Extended Data Fig. 1). All these steps 
restrain non-specific immune reactions to the inflammatory site, pre- 
venting damage of healthy organs. 

These findings highlight a double-edged role of MET in cancer: on 
the one hand, in MET-addicted tumours, this pathway is vital for the 
cell cycle and survival’; on the other hand, it promotes anti-tumori- 
genic activities in neutrophils. Thus, alternative approaches targeting 
MET on cancer cells only, and trials guided by new patient selection 
strategies will be important to maximize the efficacy of MET inhibition 
in oncological diseases**”°. 

Finally, given the fact that MET-inhibiting drugs are not associated 
with overt toxicity’, MET-targeted therapies might ameliorate the 
symptoms of inflammatory disorders in which neutrophils are import- 
ant for disease pathogenesis”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Animals. The Met floxed mice, a gift from S. S. Thorgeirsson, were backcrossed in 
a C57BL/6 background. The Tie2:Cre and MMTV-PyMT transgenic lines were 
obtained from our mouse facility. The Mrp8:Cre mice were a gift from C. A. Lowell 
and P. Bruhns'*”***. C57BL/6 mice were purchased from Harlan. TNFR1 KO 
mice and TNFR2 KO mice were a gift from C. Libert. The Met floxed mice were 
intercrossed for at least two generations with Tie2:Cre or Mrp8:Cre in order to 
obtain Met Cre-negative (WT) or Met Cre-positive (KO) littermates for the 
specific promoter. All the experimental procedures were approved by the 
Institutional Animal Care and Research Advisory Committee of the KU 
Leuven. In all the experiments, mice were gender and age matched (within an 
age range between 6 and 10 weeks). 

Cell lines. Murine Lewis lung carcinoma cells (LLC), melanoma B16F10, and 
human non-small-cell lung carcinoma A549 cells were obtained from the 
American Type Culture Collection (ATCC); the murine pancreatic tumour cell 
line PancO2 and the murine fibrosarcoma cell line T241 were gifts from U. 
Cavallaro and L. Claesson-Welsh, respectively. LLC, B16F10, A549 and T241 cells 
were cultured in DMEM (Gibco) supplemented with 2 mM glutamine, 100 units 
ml penicillin, 100 jg ml’ streptomycin and containing 10% FBS (DMEM 10% 
FBS). Panc02 cells were cultured in RPMI (Gibco) supplemented with 2mM 
glutamine, 100 units ml~’ penicillin, 100 :g ml’ streptomycin and containing 
10% FBS. All these murine tumour cell lines are syngeneic in a C57BL/6 back- 
ground, allowing implantation in Met conditional knockout mice or chimaeras. 
HUVECs were isolated from human umbilical cords and maintained in M199 
(Invitrogen) supplemented with 20% FBS, 2mM glutamine, 100 units ml’ peni- 
cillin, 100 pg ml! streptomycin, 0.15% heparin, 20 pg ml | ECGS (M199 com- 
plete). 0.1% pork gelatin was used to favour the adhesion of HUVECs to the flask 
bottom. All cells were maintained in a humidified incubator in 5% CO, and 95% 
air at 37 °C. Three or four different shRNA lentiviral vectors (Sigma) were used to 
silence Met in LLC, B16F10 or T241 cells (LLC shMet, B16F10 shMet or T241 
shMet; see later), or to silence TNFA in HUVECs (HUVEC shTNFA). Scramble 
lentiviral vectors were used as control. Transduced cells were selected with 8 1g 
ml! puromycin. All cancer cell lines and primary HUVECs underwent myco- 
plasma testing before their use. Negative mycoplasma contamination status was 
verified using LookOut Mycoplasma PCR Kit (Sigma) and MycoAlert 
Mycoplasma Detection Kit plus Assay Control (Lonza). Panc02 and T241 cells 
were both authenticated by Idexx Bioresearch. All cells were passaged in the 
laboratory for no longer than 6 months after receipt. Supplementary Table 1 lists 
the sequences of all the shRNA constructs used in this study. 

Bone marrow transplantation and blood cell count. Recipient 6-week-old 
female mice were lethally irradiated (9.5 Gy) and then intravenously injected with 
10’ bone marrow cells from WT or KO mice 16h later. Experiments were initiated 
5 weeks after bone marrow reconstitution. Blood cell count was determined using 
a haemocytometer on peripheral blood collected by retro-orbital bleeding. 
Haematopoietic stem/progenitor cell transduction. For MET overexpression or 
reconstitution in WT or Met KO neutrophils, respectively, lineage-negative 
haematopoietic stem/progenitor cells (HSPCs) were enriched with the mouse 
haematopoietic progenitor enrichment kit (Stem Cell Technologies) and checked 
for purity by FACS according to the manufacturer’s protocol. 1 X 10° cells ml“! 
were pre-stimulated for 5h with stem span serum-free medium (Stem Cell 
Technologies) supplemented with IL-3 (20ng ml‘), SCF (100 ng ml‘), TPO 
(100 ng ml © ') and ELT-3L (100 ngml~ 1) (Promega), and transduced with 1 x 10° 
transducing units (TU) ml’ ofa lentiviral vector expressing mouse Met under the 
promoter of the human gene S100A8 (Mrp8:Met), which has been engineered for 
neutrophil-specific transcriptional targeting, or an empty vector (Mrp8:Empty) as 
control. Briefly, the promoter driving Met expression in neutrophils only corre- 
sponds to a 3.6 kb DNA fragment encompassing the natural 5’ and 3’ regulatory 
regions but deleted of its exon coding sequences. Hence, Met is under the control 
of the 5’ and 3’ untranslated regions of the human S$100A8 gene and other prox- 
imal cis regulatory sequences present in the surrogate DNA fragment’**?*. Ten 
hours after the first viral transduction, cells received a second round of their 
respective lentiviral vector; 7h later 1 X 10° cells were injected via tail vein in 
lethally irradiated C57BL/6 recipient mice. A fraction of transduced HSPCs were 
cultured and collected after 9 days to measure the number of integrated vector 
copies per cell genome (vector copy number (VCN)) by qPCR using custom 
TaqMan assays specific for HIV gag sequences (Applied Biosystems), as previously 
described’’. Standard curves for HIV gag (contained by both Mrp8:Empty and 
Mrp8:Met lentiviral vectors) were obtained from the corresponding plasmids. 
Fifty nanograms of genomic DNA from each sample were subjected to qPCR in 
quadruplicate using an ABI Prism 7500 Fast Real-Time PCR System (Applied 
Biosystems). VCN was determined comparing the amplification signal on the 
genomic DNA with the standard curve consisting of serial dilutions over a 6 log 
range (slope, approximately —3.3; intercept, approximately 35; efficiency percent- 


age, approximately 100). Average copies per cell genome were calculated taking 
into account that one murine diploid genome = 5.92 pg. The results of this ana- 
lysis are shown in Supplementary Table 2. 

Tumour models. 2 x 10° LLC or 1 X 10° B16F10 cells were injected subcuta- 
neously while 2 X 10° T241 cells were injected intradermally in a volume of 
200 pl PBS. Tumour volumes were measured three times a week with a calliper. 
At end stage, tumours were weighed and collected for histological examination or 
FACS analysis. MMTV-PyMT™ spontaneous breast tumours were measured 10 
weeks after birth (6 weeks after bone marrow transplantation), three times a week, 
and mice were killed at week 16. Lung metastases were contrasted by intratracheal 
injection of a 15% India ink solution, by H&E staining on lung paraffin sections, or 
detected by qRT-PCR for the melanoma-specific gene S100B in the models invol- 
ving B16F10 cells. For orthotopic pancreatic tumour growth, mice were anaesthe- 
tized with isoflurane, the stomach exteriorized via abdominal midline incision, 
and 1 X 10° Panc02 tumour cells in 30 pl PBS were injected into the head of the 
pancreas using a 29-gauge needle. A successful intrapancreatic injection of tumour 
cells was identified by the appearance of a fluid bleb without intraperitoneal 
leakage. Mice displaying peritoneal leakage were immediately killed and excluded 
from the analysis. At day 12, primary tumours were removed and weighed. 
Enlarged lymph nodes were counted under a stereoscopic microscope. For the 
chemically induced colorectal cancer model, body-weight-matched mice received 
one intraperitoneal injection of 10 mg kg ' of azoxymethane (AOM) followed by 
3 cycles of 7 days of 1.5% (cycle I) or 1.7% (cycle II-III) dextran sodium sulphate 
(DSS) in drinking water, starting from the day of AOM injection”. After 160 days, 
the colon was collected and prepared for histological evaluation with the ‘Swiss 
roll’ technique”. For the oncogene-driven hepatocellular carcinoma model, mice 
received a 1:1 molar ratio (3 jig total DNA) of piggyBac transposons encoding for 
c-Myc and H-Ras@" oncogenes, driven by the PGK promoter, together with the 
hyperactive piggyBac transposase-encoding plasmid*®. DNA solutions containing 
transposon/transposase plasmids were diluted in 2 ml of Ringer’s solution and 
hydrodynamically delivered in 7 s through the tail vein. Mice were killed 24 weeks 
after the hydrodynamic injection. 

Lung colonization assay. In the experimental metastasis assays, 0.5 X 10° B16F10 
cells were injected in the tail vein and lungs were collected after 12 days. To 
quantify pulmonary seeding, lungs were homogenized in Trizol (Ambion) and 
RNA was purified with the RNeasy Mini kit (Ambion) according to manufac- 
turer’s instructions. The expression of the melanocyte-specific gene S100B was 
measured as a readout of lung colonization by qRT-PCR following reverse tran- 
scription to complementary DNA with the QuantiTect Reverse Transcription kit 
(Qiagen). 

Mice treatments. To induce chronic colitis, mice received 3 cycles of 7 days of 
1.5% (cycle I) or 1.7% (cycle II-III) DSS in drinking water; 2 weeks after the last 
DSS cycle, the colon was collected and prepared for histological evaluation as 
described earlier”. For in vivo MET inhibition, B16F10 tumour-bearing mice 
received 40 mgkg ' PF-04217903 (AbMole Bioscience) or the corresponding 
vehicle (0.5% methylcellulose in saline) via oral gavage every day once a day 
starting from day 2 after tumour injection and twice a day from day 11 until the 
end of the experiment; alternatively mice were treated with 50mgkg ' 
INCB28060 (AbMole Bioscience) or 50mg kg! JNJ-38877605 (Selleckchem). 
For TNF-« inhibition in vivo, LLC tumour-bearing mice were randomized for 
comparable tumour volumes at the end stage and intraperitoneally (i-p.) injected 
with 10mgkg ’ of Enbrel or human IgG in PBS, 3 and 1 day before tumour 
collection. For HGF inhibition in vivo, LLC tumour-bearing mice were rando- 
mized when average tumour volume was 300 mm’ and ip. injected with 0.2 mg 
per mouse of anti-HGF blocking antibody (AF-2207, R&D"*") or goat IgG in PBS. 
Tumours were collected 20h later for histological analysis; 12-O-tetradecanoyl- 
phorbol-13-acetate (TPA) ear painting was done 5h after antibodies administra- 
tion and ears were collected 15h later. For neutrophil depletion, mice were treated 
with 50 ug per 20g body weight of rat anti-mouse Ly6G antibody (clone 1A8, 
BioXCell) or rat IgG every second day starting from day 4 after LLC tumour 
injection and every day from day 12 after tumour injection until the end of the 
experiment. Efficiency of neutrophil depletion was assessed by FACS in blood and 
tumours. 

TPA model of acute skin inflammation. Phorbol ester TPA was used to induce 
acute skin inflammation as described previously”. Briefly, TPA (2.5 pg in 20 pl 
acetone per mouse) was topically applied on the ear skin of anaesthetized mice. 
The contralateral ear was painted with acetone alone as vehicle control. Mice were 
killed after 24h and ears were collected in 2% PFA for histological analysis. 
Zymosan-mediated acute peritonitis model. To induce acute peritonitis, zymo- 
san A (Sigma) was prepared at 2mgml ' in sterile PBS. Four hours after ip. 
injection of 0.1 mg zymosan A per mouse, inflammatory exudates were harvested 
by peritoneal lavage with 2 ml PBS. Cells were counted with a Burker chamber and 
stained for Ly6G (1A8) and F4/80 (BM8) for FACS analysis. 
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Air pouch assay. To create subcutaneous air pouches, bone marrow transplanted 
chimaeric mice or Mrp8;Mef" or Mrp8;Met"’”" mice were injected with 3 ml of 
sterile air by dorsal subcutaneous injection with a butterfly 23 G needle on day 0 
and on day 3. On day 6, 200 ng per mouse of murine CXCL1 or HGF dissolved in 
0.5 ml PBS-Heparin (15Uml ') or PBS-Heparin (15U ml ') as control, were 
injected in the newly formed dorsal camera. After 4h, inflammatory cells were 
harvested by washing the pouch with 5 ml of PBS. Cells were stained for Ly6G 
(1A8), washed and resuspended in PBS 0.1% BSA with unlabelled counting beads 
and quantified by FACS. 

Mouse white blood cell isolation. Blood was collected from the retro-orbital vein 
in 10% heparin. For the isolation of white blood cells (WBCs), the blood was 
diluted in 1.25% dextran in saline to allow the sedimentation of red blood cells 
(RBCs). After 30 min, the erythrocyte-poor upper layer was collected and washed 
in PBS 0.1% BSA. The remaining RBCs were lysed in a hypotonic solution of 0.2% 
NaCl for 30 s and brought in isotonic condition with 1.6% NaCl and 0.1% glucose. 
WECs were washed in PBS 0.1% BSA, counted and resuspended accordingly with 
the experimental setting. 

Mouse blood neutrophil isolation. Blood was collected from the retro-orbital 
vein in 10% heparin and diluted in an equal volume of PBS 0.5% BSA. Up to 5 ml of 
diluted blood was layered on top of a discontinuous gradient of Histopaque 1119 
(4 ml) and Histopaque 1077 (5 ml) from Sigma. The gradient was centrifuged for 
30 min at 700g with the brake off. The neutrophil layer between the Histopaque 
1077 and 1119 was collected and washed in PBS 0.5% BSA. RBC lysis was per- 
formed as described earlier. Neutrophils were washed in PBS 0.5% BSA, counted 
and resuspended according to the experimental condition. Alternatively, blood 
was sedimented in a saline solution containing 1.25% dextran and neutrophils 
were negatively selected with magnetic beads*’. Neutrophil purity, as assessed by 
the hemocytometer, was always higher than 93%. 

Bone marrow neutrophil and mononuclear cell isolation. To reach reasonable 
amounts of protein, all the western blot analyses in mice were performed on 
neutrophils isolated from bone marrows. Mice were killed by cervical dislocation. 
Femurs and tibias were collected in cold sterile Hank balanced salt solution (HBSS; 
Invitrogen) and flushed with HBSS 0.25% BSA. Cells were layered on top of a 
discontinuous gradient of Percoll 81%, 62%, 55%, freshly prepared and centri- 
fuged for 30 min at 2,000g with the brake off. Monocytes were collected at the 
interface between the bone marrow cells and the layer of Percoll 55%, whereas 
neutrophils were collected at the interface between Percoll 55% and 62%. Cells 
were washed in HBSS 0.25% BSA and RBC lysis was performed as described 
earlier. Neutrophils (or monocytes) were washed again, counted and resuspended 
according to the experimental setting. Neutrophil (or monocyte) purity, as 
assessed using a haemocytometer, was higher than 87%. 

FACS analysis and flow sorting of blood or tumour-associated cells. Blood was 
collected in 10% heparin, incubated for 15 min with Mouse BD Fc Block (2.4G2, 
BD Pharmingen) 1:100 and stained for 20 min at room temperature. After RBC 
lysis, cells were washed and resuspended in FACS buffer (PBS containing 2% FBS 
and 2mM EDTA). Tumours were minced in RPMI medium containing 0.1% 
collagenase type I and 0.2% dispase type I (Gibco) for 30 min at 37 °C and passed 
through a 70 and 40 um cell strainer. After RBC lysis, cells were resuspended in 
FACS buffer. Lungs were collected after 7 min of transcardial perfusion with saline 
and processed as for the tumours. Single-cell suspensions were incubated for 
15min at 4°C with Mouse BD Fc Block 1:100 in FACS buffer. The antibodies 
were added directly in the blocking solution in the appropriate combinations (as 
indicated in the figure legends). CD45 (30F-11), CD11b (M1/70), Ly6G (1A8), 
CD45R (RA3-6B2), CD3 (17A2), CD4 (RM4-5), IgE (R35-72), CD49b (DX5), 
Ly6C (AL-21) (all from BD Pharmingen), F4/80 (BM8), CD115 (AF598), 
MHCII (M5/114.15-12) (all from eBioscience), were used 1:200 for 2 X 10° in 
100 jul; Siglec-F (E50-2440, BD Pharmingen) was diluted 1:750. In all the stainings 
7AAD (BD Pharmingen) was used to gate out dead cells. For intra-tumour pro- 
liferation, 1 mg of BrdU was i.p. injected in each mouse 4h before tumour collec- 
tion and cell proliferation was quantified on single-cell suspensions with the FITC 
BrdU Flow Kit (BD Bioscience) according to the manufacturer’s protocol. Tumour 
apoptosis was assessed by staining single-cell suspensions for the apoptotic marker 
AnnexinV (1:40, BD Bioscience), excluding 7AAD-positive cells. For TAN apop- 
tosis, tumour single-cell suspensions were gated for CD11b (M1/70) and Ly6G 
(1A8); AnnexinV and 7ADD were used to distinguish apoptotic or dead neutro- 
phils. In vitro neutrophil apoptosis was performed by seeding neutrophils isolated 
from tumour-bearing mice at a concentration of 1 X 10° per ml and stimulating 
them with or without LPS (1 ug ml~ ), alone or in combination with HGF (100 ng 
ml!) for 10 hat 37 °C. Cells were collected, washed and stained for AnnexinV and 
7AAD. The combination of CD11b (M1/70), Ly6G (1A8) and MET (eBioclone 7, 
eBioscience, 1:50) was used to identify triple-positive MET-expressing neutrophils 
in blood or 7AAD-negative single-cell suspensions from tumours. Freshly stained 
samples were analysed by FACS Canto II (BD Bioscience). For tumour-associated 
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neutrophil sorting, the myeloid population was enriched by coating with CD11b- 
conjugated magnetic beads (MACS, Miltenyi Biotec), followed by separation 
through magnetic columns (MACS, Miltenyi Biotec), staining with Ly6G and 
sorting with a FACS Aria I (BD Bioscience). Cells were collected in RLT buffer 
(Qiagen) for RNA extraction or resuspended according to the experimental con- 
ditions. 

Human neutrophil isolation. Ten millilitres of venous blood from healthy volun- 
teers were collected in citrate-coated tubes and isolated by erythrocyte sedimenta- 
tion with dextran and purification with a discontinuous plasma-Percoll gradient as 
already described™. 

Lung cancer patients. We enrolled four non-small-cell lung carcinoma patients 
with no previous history of oncological, chronic inflammatory, or autoimmune 
diseases within 10 years before this study. This patient list includes 2 males (61 and 
71 years of age) and 2 females (64 and 68 years of age), of which 3 were smokers 
(1 T1NOMO adenocarcinoma, 1 T2NOMO large-cell carcinoma, 1 T3NOMO 
squamous-cell carcinoma) and 1 was a non-smoker (T2NOM0 adenocarcinoma). 
The protocol was approved by the Ethics Committee of the University Hospitals 
Gasthuisberg (Leuven), and all subjects consented before study participation. 
Flow sorting of neutrophils from lung cancer patients. Lung tumour biopsies or 
healthy tissues were minced in RPMI medium containing 0.1% collagenase type I, 
0.2% dispase type I and DNase I 100 U ml! (60 min at 37 °C), passed through a 
19 G needle and passed through a 70 and 40 1m cell strainer. After RBC lysis, cells 
were resuspended in FACS buffer (PBS containing 2% FBS and 2mM EDTA) and 
counted. The myeloid population, enriched using CD11b-conjugated magnetic 
beads (MACS, Miltenyi Biotec) and separated through a magnetic column 
(MACS, Miltenyi Biotec), was stained with anti-CD66b (G10F5, BD Pharmingen, 
1:100) for 20 min at 4°C and sorted with FACS Aria I (BD Bioscience). Cells were 
counted and resuspended in RLT buffer (Qiagen) for RNA extraction. 
Endothelial cell isolation. Lungs were collected and a single-cell suspension was 
obtained as described earlier. Endothelial cells were obtained by performing a 
negative selection for CD45 (30F-11) and F4/80 (CI:A3-1) followed by a positive 
selection for CD31 (MEC 13.3) by using magnetic beads (Dynabeads, Invitrogen) 
according to the manufacturer’s protocol. 

Peritoneal macrophages. Five millilitres of sterile PBS were injected in the peri- 
toneum of anaesthetized mice and collected after 3 min. Cells were centrifuged, 
washed and cultured overnight. 

TCM and LLC (or A549) CCM preparation. Two grams of end-stage LLC 
tumour explanted from WT mice were chopped and incubated at 37°C in 7 ml 
of DMEM (supplemented with 2 mM glutamine, 100 units ml” ' penicillin/100 pg 
ml! streptomycin) FBS-free (DMEM 0% FBS). 5 X 10* LLC (or A549) were 
seeded in a 6-multiwell plate in DMEM 10% FBS and incubated at 37°C. 
Medium alone (DMEM 0% FBS or DMEM 10% FBS, respectively) was used to 
prepare mock controls. After 72h, the medium was filtered, supplemented with 
2mM glutamine and 20mM HEPES and stored at —20°C. TCM and mock 
medium (DMEM 0% FBS) were diluted 1:5 in DMEM 10% FBS; CCM and mock 
medium (DMEM 10% FBS) were diluted 4:5 in DMEM FBS-free. 

qRT-PCR. For mRNA analysis, 1 X 10° or 3 X 10° mouse or human blood neu- 
trophils, respectively, were incubated in normoxic (21% oxygen) or hypoxic (1% 
oxygen) conditions, or stimulated with TCM (plus 50 pgml~' Enbrel or human 
IgG where indicated), CCM, A549-CCM, 100 ng ml ~ of murine or human TNF- 
a, 50ng ml! LPS, or mock medium in a 96-multiwell plate for 4h at 37°C. For 
NF-«B inhibition, 0.18 X 10° neutrophils were pre-treated with 10 1M 6-amino- 
4-(4-phenoxyphenylethylamino) quinazoline (Calbiochem) for 1h at 37°C and 
stimulated with 100 ng ml’ of murine TNF-« for 1h at 37°C. 2 X 10° HUVECs 
were seeded in a 24-multiwell plate coated with 0.1% gelatin and stimulated with 
5ng ml~? IL-l in DMEM 10% EBS for 4h at 37 °C. Cells were washed in PBS, 
collected in RLT buffer (Qiagen) and kept at —80 °C. RNA was extracted with the 
RNeasy Micro kit (Qiagen) according to the manufacturer’s instructions. Reverse 
transcription to CDNA was performed with the SuperScript III First Strand cDNA 
Synthesis Kit (Life Technologies) according to the manufacturer’s protocol. Pre- 
made assays were purchased from Applied Biosystem, except for Nos2 that was 
provided by IDT. cDNA, primer/probe mix and TaqMan Fast Universal PCR 
Master Mix were prepared in a volume of 10 11 according to manufacturer’s 
instructions (Applied Biosystems). Samples were loaded into an optical 96-well 
Fast Thermal Cycling plate (Applied Biosystems) and qRT-PCR were performed 
using an ABI Prism 7500 Fast Real-Time PCR System (Applied Biosystems) 
ELISA. To quantify plasma, intra-tumoural, tumour-released (TCM) and neu- 
trophil-released HGF, a murine HGF ELISA kit (R&D) was used according to 
manufacturer’s protocols. Blood was collected from tumour-free or tumour-bear- 
ing mice and plasma was prepared according to manufacturer’s instruction. TCM 
was prepared as described earlier. Tumour proteins were extracted in Extraction 
Buffer (20M Tris HCl, 150mM NaCl, 1% Triton X-100, 10% glycerol, 5mM 
EDTA). 0.2 X 10° neutrophils were cultured for 20h in DMEM complete in 
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presence or absence of 100 ng ml’ of murine TNF-«. Medium was collected, spun 
down and supernatant stored at —80°C until use. Mock medium was used as 
negative control. For phospho-MET quantification, 6 X 10° mouse blood neutro- 
phils isolated from tumour-bearing mice were cultured in the presence or absence 
of 100 ng ml! of murine TNF-o; 11h later, cells were stimulated or not with 
100ng ml‘ of mouse HGF in the presence of 1 mM orthovanadate for 3 min at 
37 °C, washed in PBS supplemented with 1 mM orthovanadate and 1X phosStop 
(Roche) and lysed in Extraction Buffer supplemented with 1 mM orthovanadate, 
2X phosStop and Complete Mini protease inhibitor cocktail (Roche) for 20 min at 
4°C. After clearance, samples were quantified and the same amount of proteins 
was used for MET and phospho-MET detection using a sandwich ELISA. Briefly, 
96-well microtitre plates (MaxiSorp, Nunc) were coated with 1 pg ml of anti- 
mouse MET antibody (AF527, R&D Systems) overnight at 4°C and then incu- 
bated for 2h at room temperature in blocking buffer (PBS, 0.1% Tween-20, 6% 
non-fat dry milk). The same amount of proteins per sample was diluted in block- 
ing buffer and incubated for 2 h at room temperature on the ELISA plate. After 6 
washes in PBS 0.1% Tween-20, samples were incubated for 2h at room temper- 
ature with the mouse anti- MET (3D4, Invitrogen) or the mouse anti-phosphotyr- 
osine (4G10, Merck Millipore) antibodies diluted 1:500 in blocking buffer, then 
washed 6 times in PBS 0.1% Tween-20, and incubated with goat anti-mouse 
immunoglobulins conjugated to horseradish peroxidase (sc-2031, Santa Cruz 
Biotechnology) diluted 1:500 in blocking buffer for 2h at room temperature. 
Signals were developed by 15 min incubation with the 3,3’,5,5'-tetramethylbenzi- 
dine (TMB) substrate solution (Promega). After stopping the reaction with H,SO,, 
absorbance was measured at 450 nm and corrected for 630 nm with a spectropho- 
tometer. 

Western blot. To assess MET deletion, bone marrow cells, neutrophils, peritoneal 
macrophages, monocytes (all cultured overnight in TCM) or endothelial cells were 
lysed in hot Laemmli buffer (2.5% SDS, 25% Tris-HCl pH 6.8) for 10 min at 96 °C, 
sonicated, cleared and quantified. Alternatively, 2 x 10° bone-marrow-derived 
neutrophils from WT mice were stimulated with TCM, CCM, 100ng ml! of 
murine TNF-« (or mock medium 0% FBS or 10% FBS as control) for 20h at 
37°C. For the co-culture with HUVECs, a monolayer of HUVECs was stimulated 
for 4h with 5 ng ml”! IL-1o. at 37 °C, and washed before neutrophil seeding. After 
20h of stimulation, neutrophils were collected using Cell Dissociation Buffer 
Enzyme Free PBS-Based (Gibco). Cells were washed in PBS, lysed in 15 pl of a 
protease inhibitor mixture and incubated for 15 min on ice. The protease inhibitor 
mixture was obtained by dissolving one tablet of Complete Mini protease inhibitor 
mixture (Roche) in 5 ml of PBS with 2mM diisopropyl fluorophosphate (DFP; 
Acros Organics). After addition of an equal amount of 2X SDS sample buffer 
supplemented with 4% 2-mercaptoethanol, the lysates were boiled for 15 min and 
kept at —80°C until use. NF-«B inhibition was achieved by pre-treating 7 x 10° 
neutrophils with 101M 6-amino-4-(4-phenoxyphenylethylamino) quinazoline 
(Calbiochem) for 1h at 37°C; cells were then stimulated with murine TNF-« 
(100ngml~') for 5h before lysis. Human MET was assessed by stimulating 
3 X 10’ blood neutrophils isolated from the blood of healthy volunteers with 
A549-CCM, 100ngml~* human TNF-«, 50ngml~' LPS (or mock medium 
10% FBS as control) for 20h. Cells were incubated with 2.7 mM DFP for 15 min 
at 4°C, collected and washed in PBS supplemented with 2.7mM DFP and 
Complete Mini protease inhibitor 1X, and lysed in hot Laemmli buffer at 96 °C 
for 10 min. Cell lysates were sonicated, cleared and quantified. 6 loading buffer 
was added before loading on the gel. The following primary antibodies were used: 
mouse anti-mouse Met (3D4, Invitrogen), mouse anti-mouse B-actin (I-19, Santa 
Cruz), mouse anti-vinculin (hVIN-1, Sigma), rabbit anti-human MET (D1C2, Cell 
Signaling), horseradish peroxidase (HRP)-conjugated anti-f-tubulin (Abcam). 
The following secondary antibodies were used: HRP-conjugated goat anti-mouse 
and HRP-conjugated goat anti-rabbit (Santa Cruz). Signal was visualized by 
Enhanced Chemiluminescent Reagents (ECL; Invitrogen) or West Femto by 
Thermo Scientific according to the manufacturer’s instructions. 

Adhesion assay. 4 X 10* HUVECs were seeded in M199 complete in a 96-multiwell 
plate, previously coated with 0.1% gelatin. After 12 h, HUVECs were stimulated 
with 5ngml~’ IL-1o in DMEM 10% FBS at 37°C. After 4h the endothelial 
monolayer was gently washed and 2.5 X 10° WBCs were isolated indistinctly 
from Tie2;Met™™ and Tie2;Met™ mice or from WT->WT and KO->WT trans- 
planted mice, were seeded on top of it, with or without murine HGF (50 ng ml — 1), 
After 15 min non-adherent cells were washed out whereas adherent cells were 
detached by using a Cell Dissociation Buffer Enzyme Free PBS-Based (Gibco). 
Cells were stained for Ly6G (clone 1A8), washed and resuspended in PBS-BSA 
0.1% with unlabelled counting beads (BD Bioscience) and quantified by FACS 
Canto II (BD Bioscience). 

Transmigration and migration assay. For the transmigration assay, 2 X 10° 
HUVECs were seeded on 3 jum polycarbonate membrane (Transwell; Costar) prev- 
iously coated with 0.1% gelatin in M199 complete. After 12h, HUVECs were 


stimulated for 4h at 37°C in DMEM 10% FBS with 5ngml~' IL-1o and then 
washed. 5 X 10° WBCs isolated indistinctly from Tie2:Met”™ and Tie2:;Met™ mice 
or from WT—WT and KO—WT transplanted mice were seeded on top of the 
endothelial monolayer, while mock medium, TCM (with or without 3 pg ml! 
anti-HGF antibody AF-2207; R&D)'**! or 50 ng ml! murine HGE was added in 
the bottom. After 2h at 37 °C, transmigrated cells were collected from the bottom 
chambers and from the lower side of the filter with cold PBS 0.5% EDTA. Cells were 
stained and Ly6G* cells were quantified as described earlier. In the migration assays 
WBCs were seeded directly on top of 3 jum polycarbonate porous membranes. 
Nitric oxide measurement by FACS. Neutrophils isolated from the blood of WT 
or KO LLC-tumour bearing mice were co-cultured for 4h with LLC shMet, 
washed and resuspended in 20 mM HEPES-PBS, and incubated for 10 min with 
5 uM DAF-FM diacetate (Molecular Probes) in the absence or presence of mouse 
HGF (100 ng ml‘) at 37 °C. Cells were then washed and analysed by FACS. 
Cytotoxicity assay. LLC shMet or T241 shMet were transduced with a luciferase- 
expressing lentiviral vector (EX-hLUC-Lv114 from GeneCopoeia); 10* cells were 
then seeded in DMEM 10% EBS in a 96-multiwell plate. After 4h, 0.2 X 10° neu- 
trophils purified from the blood of tumour-bearing mice or directly from the 
tumours themselves, were co-cultured with the cancer cells in DMEM 2% FBS 
for 4h at 37°C, with or without 100 ng ml! mouse HGF or 1mM L-NMMA 
(Sigma). After washing, adherent cells were lysed in 0.2% Triton, 1 mM dithio- 
threitol (DTT). Luciferase signal was revealed with a microplate luminometer. The 
use of shMet was thought to prevent any possible confounding activity of MET on 
cancer cell survival and thus to restrict the effect of HGF to neutrophils only. 
Histology and immunostainings. To obtain serial 7-j1m-thick sections, tissue 
samples were immediately frozen in OCT compound or fixed in 2% PFA overnight 
at 4 °C, dehydrated and embedded in paraffin. Paraffin slides were first rehydrated 
to proceed further with antigen retrieval in citrate solution (DAKO). Cryosections 
were thawed in water and fixed in 100% methanol. If necessary, 0.3% HO. was 
added to methanol to block endogenous peroxidases. The sections were blocked 
with the appropriate serum (DAKO) and incubated overnight with the following 
antibodies: rat anti-CD45 (30F-11, BD Pharmingen) 1:100, rat anti-Ly6G (1A8, 
BD Pharmingen) 1:100, rat anti-CD31 (MEC 13.3, BD Pharmingen) 1:200, rabbit 
anti-FITC (Serotec) 1:200, goat anti-phosphohistone H3 (Cell Signaling) 1:100, rat 
anti-F4/80 (CI:A3-1, Serotec) 1:100, mouse anti-NK1.1-biotin (PK136, BD 
Pharmingen) 1:200, rat anti-CD45R (RA3-6B2, BD Pharmingen) 1:100, rat 
anti-CD4 (H129.9, BD Pharmingen) 1:100, rat anti-CD8 (53-6.72, BioXCell) 
1:100, hamster anti-CD11c biotin (N418, eBioscience) 1:100, mouse anti-3-nitro- 
tyrosin (HM.11 Santa Cruz) 1:200. Appropriate secondary antibodies were used: 
Alexa488-or Alexa568-conjugated secondary antibodies (Molecular Probes) 
1:200, HRP-labelled antibodies (DAKO) 1:100, Biotin-labelled antibodies 
(Jackson ImmunoResearch) 1:100. When necessary, tyramide signalling amp- 
lification (Perkin Elmer, Life Sciences) was performed according to the manufac- 
turer’s instructions. Whenever sections were stained in fluorescence, ProLong 
Gold mounting medium with DAPI (Invitrogen) was used. Otherwise, 3,3’-dia- 
minobenzidine was used as a detection method followed by Harris’ haematoxilin 
counterstaining, dehydration and mounting with DPX. Apoptotic cells were 
detected by the TUNEL method, using the AptoTag peroxidase in situ apoptosis 
detection kit (Millipore) according to the manufacturer’s instructions. For the 
double staining TUNEL and Ly6G, TUNEL staining was performed as described 
earlier, followed by Ly6G staining by using the Vectastain ABC kit (Vector 
Laboratories) according to the manufacturer’s instructions. Tumour necrosis 
and lung metastasis were evaluated by H&E staining. Necrotic area was defined 
as the area including necrotic cancer cells, inflammatory cells and stromal cells, 
compared to the total area of the field. Necrotic cells display a more glassy homo- 
geneous appearance in the cytoplasm with increased eosinophilia, while the nuc- 
lear changes are reflected by karyolysis, pyknosis and karyorrhexis. Alternatively, 
the necrotic tissue was visualized by autofluorescence as previously described”. 
Microscopic analysis was done with an Olympus BX41 microscope and Cell Sense 
imaging software or a Zeiss Axioplan microscope with KS300 image analysis 
software. The morphometric analysis was performed by acquiring 4-6 fields per 
section on five independent sections (at a distance of 401m in depth during 
sectioning) from the same biological tissue sample. The values in the graphs 
represent the average of the means of at least five samples and the standard error 
indicates the variability among the different samples. 

Hypoxia assessment and tumour perfusion. Tumour hypoxia was detected by 
injection of 60mgkg~' pimonidazole into tumour-bearing mice 1h before 
tumour harvesting. To detect the formation of pimonidazole adducts, tumour 
cryosections were immunostained with Hypoxyprobe-1-Mab1 (Hypoxyprobe 
kit, Chemicon) following the manufacturer’s instructions. Perfused tumour vessels 
were counted on tumour cryosections from mice injected intravenously with 
0.05 mg fluorescein isothiocyanate (FITC)-conjugated lectin (Lycopersicon escu- 
lentum; Vector Laboratories). 
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Tumour-derived nitric oxide production. LLC tumours were collected 8 days 
after injection, cut in pieces of about 5 X 5 mm, weighted and incubated at 37 °C in 
a 24-multiwell plate with 800 pl of DMEM. After 24h, the media was collected, 
centrifuged to remove cell debris, and NO levels were measured using the Griess 
reagent system kit (Promega). 

Statistics. Data entry and all analyses were performed in a blinded fashion. All 
statistical analyses were performed using GraphPad Prism software. Statistical 
significance was calculated by two-tailed unpaired t-test on two experimental 
conditions or two-way analysis of variance (ANOVA) when repeated measures 
were compared, with P < 0.05 considered statistically significant. Data were tested 
for normality using the D’Agostino-Pearson omnibus test (for n > 8) or the 
Kolmogorov-Smirnov test (for 1 = 8) and variation within each experimental 
group was assessed. Detection of mathematical outliers was performed using the 
Grubbs’ test in GraphPad. Sample sizes for all experiments were chosen based on 
previous experiences. Independent experiments were pooled and analysed 
together whenever possible as detailed in figure legends. All graphs show mean 
values + s.e.m. 
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Extended Data Figure 1 | Scheme illustrating the role of MET in 
neutrophils. During cancer or infections, the release of cytokines such as IL-1 
at the inflammatory site will promote the expression of TNF-« on the 
endothelium and the surrounding tissue. When circulating neutrophils 
encounter the activated endothelium, TNF-« will unleash NF-«B through 
binding to TNFR1, which in turn will induce MET expression on the neutrophil 
surface. HGF, also released and proteolytically activated at the site of 


(vs. cancer cells or pathogens) 


inflammation, will bind to MET and stimulate the firm adhesion of neutrophils 
to the endothelium, probably via integrin engagement, and thus neutrophil 
diapedesis. Once extravasated, the HGF/MET pathway will still function on 
neutrophils by reinforcing their cytotoxic response through the induction 

of iNOS and NO production, ultimately favouring a bactericidal and 
tumoricidal neutrophil phenotype. 
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Extended Data Figure 2 | Met deletion in immune cells, but not in 
endothelial cells, fosters tumour growth. a, MET expression in total bone 
marrow (BM) cells, endothelial cells (EC) and neutrophils harvested from Met 
floxed mice intercrossed with the Tie2:Cre deleter thus generating Tie2;Met™" 
(KO) or Tie2;Met’””' (WT) mice. Western blots are representative of three 
repetitions on independent biological replicates. Western blot images have 
been cropped for presentation. Full scan images are shown in Supplementary 
Fig. 1. b-d, Quantification (b) and representative images of tumour sections’ 
TdT-mediated dUTP nick end labelling (TUNEL) stainings (c, d) from 
subcutaneous end-stage LLC tumours in WIT-—>WT and KO-WT mice. Data 
combine two independent experiments; total n = 10 mice per condition. 

e, FACS quantification of AnnexinV* 7AAD~ early apoptotic tumour cells in 
WT-WT and KO-WT mice. Data combine two independent experiments; 
total n = 8 mice per condition. f-j, Tumour necrosis quantification in 
WT-WT and KO—WT mice (f), assessed by histological evaluation of H&E- 
stained tumour sections (g, h) and by measurement of autofluorescent tumour 
areas (i, j); yellow dotted lines demarcate necrosis. Data combine two 
independent experiments; total n = 10 mice per condition. 

k-m, Quantification (k) and representative images of tumour sections stained 


for the proliferation marker pHH3 (1, m) from subcutaneous end-stage LLC 
tumours in WI—WT and KO-WT mice. Data combine two independent 
experiments; total n = 10 mice per condition. n, FACS quantification of BrdU~ 
proliferating tumour cells in WI—WT and KO>WT mice. Data combine 
two independent experiments; total n = 10 mice per condition. o-r, CD31* 
vessel area (0), vessel density (p), lectin perfusion (q) and hypoxic (Pimo*) 
area (r) in LLC subcutaneous tumours from KO->WT mice (in which the 
haematopoietic/immune system is knocked out for Met) or WI—>KO mice 
(in which endothelial cells only are knocked out for Met) compared to 
control WI-WT mice. Data in o-r combine two independent 

experiments; total n: WI—>WT, 12; KO->WT, 8; WI->KO, 8. 

s-u, Subcutaneous LLC tumour growth (s), weight (t) and lung 

metastases (u) in Tie2;Met" compared to Tie2;Met””™ mice. Data combine 
two independent experiments; total n: Tie2;Met”’”, 12: 
Tie2;Met™, 10. v, Subcutaneous LLC tumour growth in endothelial-cell- 
specific Met KO (WT-—KO) and control (WT-—>WT) mice. Data combine two 
independent experiments; total n = 8 per condition. *P < 0.05 versus 

WT WT (b, e, £, k, n), versus Tie2;Met””” (s—u). Scale bars: 50 pm 

(c, d, 1, m); 100 ttm (g-j). All graphs show mean = s.e.m. 
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Extended Data Figure 3 | Circulating and tumour-infiltrating immune cells 
upon Met deletion. a—e, FACS analysis showing percentages of 

circulating monocytes (a), lymphocytes (b), neutrophils (c), eosinophils 

(d) and basophils (e) in tumour-free or in LLC-tumour-bearing WI>WT 
and KO>WT mice. Data combine two independent experiments; total n = 8 
mice per condition. f, Quantification of LLC tumour sections stained for the 
pan-leukocyte marker CD45, the macrophage marker F4/80, the NK marker 
NK1.1, the B lymphocyte marker CD45R, the T helper cell marker CD4, the 
cytotoxic T cell marker CD8 and the dendritic cell marker CD11c (with 
exclusion of F4/80* area) in WI->WT and KO->WT mice. Data combine two 
independent experiments; total n = 8 mice per condition). g, h, FACS 
quantification for tumour-associated CD45" leukocytes (g) or CD45* IgE* 
CD49b* CD4~ CD45R™ basophils and CD45* CD11b* SiglecE* Ly6C™*4 F4/ 
80'°Y MHCII” eosinophils (h) in WI>WT and KO>WT mice. 

Data combine two independent experiments; total n = 8 mice 

per condition. i, j, FACS quantification (i) and gating strategy (j) for 
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tumour-associated neutrophils selected from the main tumour cell population 
negative for 7AAD staining; tumour-associated neutrophils were then gated as 
CD11b and Ly6G double-positive cells. Data combine two independent 
experiments; total n; WI—WT, 11; KO>WT, 10. k, Ly6G* tumour 
infiltration at day 9, day 13 and day 19 after LLC subcutaneous tumour 
injection in WI—WT and KO>WT mice. Data combine two independent 
experiments; total n = 8 mice per condition. 1, Morphometric quantification of 
leukocytes and macrophages on CD45- and F480-stained lung sections, 
respectively, from LLC-tumour-bearing WIT—-WT or KO-WT mice. Data 
combine two independent experiments; total n = 8 mice per condition. 

m, FACS quantification of CD11b* Ly6G* neutrophils and CD11b™ F4/80* 
macrophages infiltrating metastatic lungs from LLC-tumour-bearing 
WT-WT or KO>WT mice. Data combine two independent experiments; 
total n = 8 mice per condition. *P < 0.05 versus WT—>WT 

(f, g, i, k-m); }P < 0.05 versus tumour free (a-d). All graphs show 

mean + s.e.m. 
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Extended Data Figure 4 | MET in neutrophils is required for their anti- 
tumour activity. a, b, Western blot analysis (a) and relative densitometric 
analysis (b) for MET expression in bone marrow neutrophils and monocytes 
upon reconstitution of WT recipient mice by WT or KO HSPCs transduced in 
vitro with an empty vector (Mrp8:Empty) or a vector expressing Met under the 
neutrophil-specific promoter Mrp8 (Mrp8:Met); tubulin was used as loading 
control. Western blots are representative of three repetitions on independent 
biological samples where each sample is the pool of neutrophils or monocytes 
isolated from three mice. Densitometric analysis was performed on these three 
western blots. A.U., arbitrary units. c, FACS analysis for green fluorescent 
protein (GFP) in circulating Ly6G* neutrophils or CD115* monocytes, 
harvested from the neutrophil-specific Mrp8:Cre line carrying separate 
expression of GFP because of an internal ribosome entry site ([RES) 


LETTER 


downstream the Mrp8-driven cre gene. Data combine two independent 
experiments; total n = 10 mice per condition. d, MET expression in 
neutrophils, monocytes and macrophages harvested from Mrp8;Met"”™ or 
Mrp8;Met™ mice. Western blots are representative of three repetitions on 
independent biological replicates. e, FACS analysis for CD11b" Ly6G* 
neutrophils in subcutaneous LLC tumours from Mrp8;Met"”™* or 
Mrp8;Met™", Data combine two independent experiments; total 1: 
Mrp8;Met"”” wt 10; Mrp8;Me¢" Vf! 11. Western blot images in a, d have been 
cropped for presentation. Full scan images are shown in Supplementary Fig. 1. 
*P <0.05 versus Mrp8:Empty WT->WT (b), versus Mrp8;Met"””* 

(e); TP < 0.05 versus Mrp8:Empty WI—>WT; {P < 0.05 versus Mrp8:Empty 
KO-—WT. All graphs show mean = s.e.m. 
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Extended Data Figure 5 | Pharmacological and genetic inhibition of MET 
prevents the recruitment of anti-tumoural neutrophils to several neoplastic 
tissues and inflammatory sites. a, Tumour weight of subcutaneous B16F10 
melanomas in WI—WT and KO>WT mice. Data combine two independent 
experiments; total n: WI—WT, 8; KO>WT, 9. b, c, Total tumour weight 
(b) and metastatic index (c) in MMTV-PyMT mice reconstituted with WT or 
Met KO bone marrow cells before tumour appearance (WT—PyMT and 
KO-—PyMT mice, respectively). Data combine three independent experiments; 
total n: WIT—PyMT, 13; KO—PyMT, 16. d, e, FACS quantification for 
CD11b* Ly6G* neutrophils in T241 tumours harvested from WT->WT or 
KO—WT mice (d) or in in breast tumours spontaneously grown in 
WT->PyMT and KO-»PyMT mice (e). Data combine two independent 
experiments; total n = 10 mice per condition (d) or total n = 8 mice per 
condition (e). f-i, Length measurement (f) and representative image (g) of the 
colon, as well as quantification of neutrophils (h) and macrophages (i) on bowel 
sections, from WT->WT and KO->WT mice upon induction of chronic colitis 
compared to healthy control. Data combine two independent experiments; 
total n: healthy, 5; WI—WT, 12; KO->WT, 15. j, k, Tumour weight (j) and 
metastatic mesenteric lymph nodes (k) 12 days after orthotopic injection of 
pancreatic Panc02 cancer cells in WI—WT and KO>WT mice. Data combine 
two independent experiments; total n = 12 per condition. 1, Histological 
quantification of Ly6G* infiltrates in Panc02 pancreatic tumours harvested 
from WT-WT and KO-WT mice. Data combine two independent 
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experiments; total n = 12 mice per condition. m, Quantification of plasma HGF 
in tumour (TM)-free mice, in subcutaneous LLC or orthotopic Panc02 tumour- 
bearing mice. Data combine two independent experiments; total n: tumour 
free, 10; LLC, 10; Panc02, 8 biological replicates. n, Quantification of HGF in 
subcutaneous LLC or orthotopic Panc02 tumours. Data combine two 
independent experiments; total n: LLC, 10; Panc02, 8 biological replicates. 

0, p, Quantification of HGF in plasma (0) or in subcutaneous LLC tumours 
(p) from tumour-bearing WI—WT and KO>WT mice. Data are 
representative of two independent experiments using 5 mice per condition per 
experiment. q, Quantification of Ly6G* area on sections from B16F10 
melanomas grown in C57BL/6 WT mice, daily treated with PF-04217903, 
INCB28060, JNJ-38877605, or vehicle as control. Data combine two 
independent experiments; total n: vehicle, 14; PF-04217903, 9; INCB28060, 6; 
JNJ-38877605, 4. r, Western blot analysis for MET in B16F10 melanoma cells 
after transduction with a lentiviral vector encoding scrambled or mouse shMet 
under a constitutive promoter; vinculin was used as loading control. Western 
blot is representative of three independent repetitions. Western blot images 
have been cropped for presentation. Full scan images are shown in 
Supplementary Fig. 1.*P < 0.05 versus WI>WT (a, d, h), versus WI—>PyMT 
(b, e), versus LLC (m, n), versus vehicle (q); }P < 0.05 versus healthy 

(f, h, i), versus tumour free (m). Scale bar: 10 mm (g). All graphs show 

mean = s.e.m. 
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Extended Data Figure 6 | HGF is required for MET activation upon 
induction by TNF-a. a, Gating strategy related to Fig. 3b to quantify MET 
expression in blood neutrophils from LLC-tumour (TM)-bearing mice and in 
TANs, where live cells were first gated as CD11b-positive cells; this population 
was finally gated for Ly6G and MET to identify MET-expressing Ly6G* 
neutrophils. b, c, qRT-PCR for MET in mouse (b) and human (c) neutrophils 
after LPS or TNF-« stimulation. Data are representative of three independent 
experiments using four biological replicates per condition per experiment. 

d, e, (RT-PCR for MET expression in mouse (d) or human (e) neutrophils 
cultured in normoxia (21% O ) or hypoxia (1% O2). Data combine two 
independent experiments; total n = 8 biological replicates per condition. 

f, g, ELISA for total MET (f) and phospho-MET (g) from mouse neutrophils 
stimulated for 3 min with mock medium or HGF after an overnight incubation 
with or without TNF-«. Data combine three independent experiments; total 
n = 6 biological replicates per condition. h, HGF release by neutrophils 
stimulated with mock medium or TNF-o after 20h in culture. Data combine 
two independent experiments; total n = 6 biological replicates per condition. 
i, RT-PCR for TNFA in HUVECs upon stimulation with IL-1 compared to 
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mock medium. Data combine two independent experiments; total n = 4 
biological replicates per condition. j, (RT-PCR for Met in mouse neutrophils 
co-cultured with HUVEC/NS or HUVEC/IL transduced with shTNFA or 
scramble as control. Data are representative of three independent experiments 
in which three different shRNA sequences were used; total n = 4 biological 
replicates per condition per experiment. k, l, RT-PCR for Met in WT, TNFR1 
KO or TNRF2 KO neutrophils upon co-culture with HUVEC/NS or HUVEC/ 
IL (hk), or after stimulation with conditioned medium (TCM) from LLC 
tumours (1). Data are representative of two independent experiments using four 
biological replicates per condition per experiment. m, qRT-PCR for MET in 
human neutrophils stimulated with A549-CCM in the presence or absence of 
Enbrel or human IgG as control. Data are representative of two independent 
experiments using four biological replicates per condition per experiment. 

*P < 0.05 versus mock (b, ¢, i), versus TNF-o alone (g), versus HUVEC/NS 
(j), versus WT (k, 1), versus A549-CCM (m); +P < 0.05 versus untreated or 
HGEF alone (f, g), versus HUVEC/NS (k), versus mock (1, m). Graph shows 


mean = s.e.m. 
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Extended Data Figure 7 | Met deletion in neutrophils does not affect 
apoptosis. a, b, Gating strategy of apoptotic WT (a) and Met KO 

(b) neutrophils in LLC tumours where single-cell suspensions were first gated 
for physical parameters and then for CD11b and Ly6G to identify neutrophils 
as double-positive cells; this population was finally gated for AnnexinV and 
7AAD: AnnexinV* 7AAD° cells display early apoptotic neutrophils whereas 
AnnexinV* 7AAD* cells display late apoptotic neutrophils. c, Quantification 
of apoptotic WT and Met KO tumour-associated neutrophils measured by 


FACS. Data combine two independent experiments; total n = 7 mice per 
condition. d, Quantification of apoptotic WT and Met KO neutrophils on LLC 
tumour sections by immunohistochemistry. Data combine two independent 
experiments; total n: WI>WT, 7; KO-WT, 6. e, FACS analysis for AnnexinV 
and 7AAD of WT or KO neutrophils incubated for 10h in the presence or 
absence of LPS and HGF, alone or in combination. Data combine two 
independent experiments; total n = 6 biological replicates per condition. 

+P <0.05 versus untreated or HGF alone. Graph shows mean + s.e.m. 
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Extended Data Figure 8 | MET affects neither neutrophil basal migration 
nor polarization but it is required for neutrophil recruitment and 
cytotoxicity. a, Quantification of Ly6G staining in ear sections upon phorbol 
ester (TPA)-induced cutaneous rash in Mrp8;Met"” wl and Mrp&iMef'? mice. 
Data combine two independent experiments; total n = 8 mice per condition. 
b, FACS analysis on peritoneal lavages for Ly6G* neutrophils or F4/80* 
macrophages in Mrp8;Met’””’ and Mrp8;Met™" mice 4h after 
intraperitoneal injection of sterile zymosan A. Data are representative of two 
independent experiments using 5 mice per condition per experiment. 

c, d, Quantification of F4/80 (c) and CD3 (d) stainings in ear sections at 
baseline and upon TPA-induced cutaneous rash. Data combine two 
independent experiments; total n: WI—WT control (CTRL), 22; KO—-WT 
CTRL, 15; WT-WT TPA, 23; KO-WT TPA, 15 (c); or total n = 8 mice per 
condition (d). e, FACS quantification of Mrp8;Met"”™* and Mrp8;Met" M 
neutrophils recruited into subcutaneous air pouches in response to HGF, 
CXCLI1 or PBS. Data combine two independent experiments; total n = 6 
mice per condition. f, FACS quantification of WT neutrophil adhesion to 
quiescent HUVECs (HUVEC/NS) or activated HUVECs (HUVEC/IL) in the 
presence or absence of HGF. Data are representative of two independent 
experiments using four biological replicates per condition per experiment. 
g, h, FACS quantification of WT and Met KO neutrophils migrated through 
a bare porous filter (that is, in the absence of HUVECs) towards HGF (g) or 
tumour conditioned medium (TCM) (h). Data are representative of two 
independent experiments using three biological replicates per condition per 


experiment. i, Gene expression profile for N1 and N2 markers in 
neutrophils sorted from LLC tumours grown in WI—WT or KO>WT 
mice. Data are representative of three independent experiments using 4 
mice per condition per experiment. j, Cytotoxicity of WT and KO 
tumour-associated neutrophils against T241 cells in the absence or 
presence of the NO synthase inhibitor L-NMMA. Data are representative 
of three independent experiments using three biological replicates per 
condition per experiment. k, FACS quantification of DAF-FM-positive 
circulating neutrophils after co-culture with LLC cancer cells as a readout 
of NO production in the absence or presence of HGF. Data are 
representative of four independent experiments using three biological 
replicates per condition per experiment. 1, Quantification of LLC cancer 
cell killing by WT and KO neutrophils (isolated from the blood of 
tumour-bearing mice), stimulated with HGF alone or in the presence of 
L-NMMA. Data are representative of two independent experiments using 
n= 12 biological replicates per condition per experiment. m, Blood 
neutrophils in WI—WT and KO-WT mice treated with neutrophil- 
depleting Ly6G antibody or rat IgG as control. Data combine two 
independent experiments; total n= 16 per condition. *P<0.05 versus 
Mrp8;Met'” “ (a, b), versus Mrp8;Met"” “! + HGF (e), versus HUVEC/ 
NS (f), versus WI->WT untreated (j), versus WI—WT + HGF 

(kl); +P <0.05 versus CTRL (c, d), versus PBS (e), versus mock 

(f, h), versus WIT—WT untreated (j-l), versus IgG (m); {P< 0.05 versus 
WT—WT + HGF (I). All graphs show mean + s.e.m. 
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Extended Data Table 1 | Blood count in Tie2;Met”™™ or Tie2;Met™" tumour-free mice 


Tumour free Tie2;Mer”™™ Tie2;Mer" 
WBC (k/ul) 5,681.44 5.55+1.29 
NEU (%) 23.03+5.45 29.67£7.88 
LYM (%) 69.7246.46 72.0344.89 
MON (%) 1.24+0.37 2.8641.15 
EOS (%) 0.12+0.05 0.1740.12 
BAS (%) 3.38+1.32 4.4742.1 
RBC (M/uL) 5.21+1.91 4.8941.52 
HCT (%) 71.3#3.43 60.2+13.38 
MCHC (g/dl) 15.83+2.65 18,340.26 
PLT (K/uL) 439.73426.64 508+55.79 


The values show the haematological parameters (mean + s.e.m.) in tumour-free Tie2;Met’™t and Tie2;Met” mice. Data combine two independent experiments; total n = 10 per condition. BAS, basophil; EOS, 
eosinophil; HCT, haematocrit; LYM, lymphocyte; MCHC, mean cell haemoglobin concentration; MON, monocyte; NEU, neutrophil; PLT, platelet; RBC, red blood cell; WBC, white blood cell. 
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Extended Data Table 2 | Blood count in WT—-WT and KO->WT tumour-free or tumour-bearing mice 


The values show the haematological parameters (mean + s.e.m.) in tumour-free or LLC-tumour-bearing (21 days) WT-+WT and KO=WT chimaeric mice. Data combine two independent experiments; total n = 10 
per condition. BAS, basophil; EOS, eosinophil; HCT, haematocrit; LYM, lymphocyte; MCHC, mean cell haemoglobin concentration; MON, monocyte; NEU, neutrophil; PLT, platelet; RBC, red blood cell; WBC, white 


blood cell. 


Tumour free WT > WT KO DWT 
WBC (k/ul) 10.03+2.05 8.66+40.93 
NEU (%) 9.18+2.1 10.3+3.07 
LYM (%) 85.2+2.91 83.944:3.46 
MON (%) 1.41+0.48 1.3+0.4 
EOS (%) 0.44+0.17 0.25+0.05 
BAS (%) 3.77+40.68 4.21+0.18 
RBC (M/pL) 8.21+0.54 9340.21 
HCT (%) 59.86+3.52 70.1641.67 
MCHC (g/dl) 18.63+0.61 13.4+0.16 
PLT (K/L) 589.26+134.65 758.4+50.63 
Tumour bearing WT > WT KO DWT 
WBC (k/pl) 7.97+0.63 9.12+1.22 
NEU (%) 44,340.37 53.71+7.23 
LYM (%) 27.29+8.33 33.962:2,52 
MON (%) 1.79+0.75 1.9+0.64 
EOS (%) 0.26+40.03 0.45+0.1 
BAS (%) 1.94+0.53 1.84+0.57 
RBC (M/pL) 5540.54 6.83+0.46 
HCT (%) 42.13+42.93 49.4343.47 
MCHC (g/dl) 18.2740.5 19.24+0.05 
PLT (K/uL) 5$66.174109.48 805.5+88.19 
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Regulation of endoplasmic reticulum turnover by 


selective autophagy 


Aliaksandr Khaminets', Theresa Heinrich?*, Muriel Mari®*, Paolo Grumati!, Antje K. Huebner’, Masato Akutsu”, 
Lutz Liebmann?, Alexandra Stolz', Sandor Nietzsche®, Nicole Koch’, Mario Mauthe?”, Istvan Katona’, Britta Qualmann’, 
Joachim Weis®, Fulvio Reggiori**, Ingo Kurth?8, Christian A. Hiibner?8 & Ivan Dikic8 


The endoplasmic reticulum (ER) is the largest intracellular endo- 
membrane system, enabling protein and lipid synthesis, ion home- 
ostasis, quality control of newly synthesized proteins and organelle 
communication’. Constant ER turnover and modulation is needed 
to meet different cellular requirements and autophagy has an 
important role in this process” *. However, its underlying regula- 
tory mechanisms remain unexplained. Here we show that members 
of the FAM134 reticulon protein family are ER-resident receptors 
that bind to autophagy modifiers LC3 and GABARAP, and facil- 
itate ER degradation by autophagy (‘ER-phagy’). Downregulation 
of FAM134B protein in human cells causes an expansion of the ER, 
while FAM134B overexpression results in ER fragmentation and 
lysosomal degradation. Mutant FAM134B proteins that cause 
sensory neuropathy in humans’ are unable to act as ER-phagy 
receptors. Consistently, disruption of Fam134b in mice causes 
expansion of the ER, inhibits ER turnover, sensitizes cells to 
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Figure 1 | FAM134B binds LC3-like modifiers and co-localizes with ER 
marker proteins. a, Overexpressed FAM134B-HA (haemagglutinin) binds 
GST-LC3-like modifiers. b, Co-immunoprecipitation (IP) of Fam134b in 
Fam134b*’* and Fam134b’~ MEFs with Lc3b. c, Domain architecture of 
FAM134B and alignment of the LIR motif. Blue, reticulon domain; red, 
LC3-interacting region (LIR); green, coiled-coil (CC) domain; Q145X and 


stress-induced apoptotic cell death and leads to degeneration of 
sensory neurons. Therefore, selective ER-phagy via FAM134 pro- 
teins is indispensable for mammalian cell homeostasis and con- 
trols ER morphology and turnover in mice and humans. 

Selective autophagy is a degradative pathway that controls the qual- 
ity and abundance of proteins and cellular organelles and is mediated 
by autophagy receptors that simultaneously bind the designated target 
and LC3/GABARAP proteins on autophagosomal membranes'*"”. To 
identify proteins involved in selective autophagy we performed yeast 
two-hybrid screens using LC3B (microtubule associated protein light 
chain 3B; MAP1LC3B) and GATE16/GABARAPL2 (y-aminobutyric 
acid receptor-associated protein-like 2) as baits’*. Members of the 
reticulon-homology-domain-containing FAM134 protein family were 
found to be interaction partners of LC3B and GABARAPI2 and this 
interaction was confirmed using FAM134B as the bait (Supplementary 
Table 1). Also, overexpressed and endogenous FAM134B directly 


FAM134B GFP-SEC61B Merge Inset 


FAM134B CLIMP-63 Merge Inset 


FAM134B RTN4—Myc Merge Inset 


S309X, disease-associated stop mutations. Residues with full (*), strong (: 

and weak (.) conservation are indicated. d, Endogenous FAM134B co- ae 
with ER-markers in A549 cells. Representatives of ten images are shown. 
Scale bar, 10 tum. e, Quantification of FAM134B co-localization presented as 
Pearson’s correlation coefficient (r); ***P < 0.001, one-way ANOVA, 

n = 50 cells. 
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Figure 2 | FAM134B targets ER into autophagosomes. a, b, Stable KDEL- 
RFP U20S cells transiently expressing FAM134B-—HA or FAM134B(mutLIR)- 
HA were quantified for KDEL-RFP- and LC3B-positive punctae (arrowheads). 
Representatives of ten images are shown. Scale bars, 10 jim. **P < 0.01, 
one-way ANOVA, error bars indicate s.d., n = 300. NS, not significant. 

c-f, Transmission electron microscopy of U2OS control cells (Ctrl) and stable 
eGFP-FAM134B cells. c, d, Autolysosomes indicated by arrowheads. 

e, Asterisks label ER fragments inside autolysosomes. f, Immunogold labelling 
against CD63 (large dots, diameter 15 nm) and KDEL peptide or GFP (small 
dots, diameter 10 nm). Treatment with bafilomycin is indicated (+Baf). AL, 


interacts with LC3/GABARAP in mammalian cells (Fig. la, b and 
Extended Data Fig. la, b). Similar results were obtained for 
FAM134A and FAMI134C proteins (Extended Data Fig. Ic). 
FAM134 proteins contain a conserved putative LC3-interacting region 
(LIR motif) (Fig. 1c). Substitution of amino acids of this LIR motif with 
alanines (DDFELL/AAAAAA; mutLIR) completely abolished the 
interaction of FAM134B with LC3-like modifiers, whereas deletion 
of the C-terminal coiled-coil domain (ACC) did not affect LC3 binding 
(Extended Data Fig. 1d). Truncated FAM134B proteins (Q145X, 
S309X) causing sensory neuropathy in humans” lack the C-terminal 
LIR motif and consequently did not bind LC3-like proteins (Extended 
Data Fig. 1d). Moreover, the crystal structure of a LIR-motif- 
containing FAM134B peptide bound to LC3A further demonstrated 


autolysosome; M, mitochondria; N, nucleus; PM, plasma membrane. Scale 
bars, 500 nm (c, d), 200 nm (e, f); m = 150 cells for c-e, n = 100 cells for 
f-h. g, h, U2OS cells stably expressing mCherry-eGFP-FAM134B or 
mCherry-eGFP-FAM134B(mutLIR). Cells that exhibited lysosomal mCherry- 
positive but eGFP-negative punctae (arrowheads) were quantified. Scale 
bars, 10 jum. **P < 0.01, one-way ANOVA, error bars indicate s.d., three 
biological replicates. i, Quantification of mCherry-positive and eGFP-negative 
punctae in wild-type and Atg5“" MEFs transiently expressing mCherry- 
eGFP-FAM134B. **P < 0.01, one-way ANOVA, error bars indicate s.d., 

n = 60 cells, three biological replicates. 


that amino acids F455 and L458 of FAM134B mediate the interaction 
with the hydrophobic pockets of LC3A (Extended Data Fig. le)'*”°. 
These data thus show a direct interaction of FAM134 proteins with the 
autophagy machinery and identify a potential novel family of auto- 
phagy receptors. 

Endogenous and overexpressed FAM134B co-localized with 
CLIMP-63 and SEC61B, marker proteins of sheet-like cisternal ER 
and to a lesser extent with RTN4-positive tubular ER (Fig. 1d, e and 
Extended Data Fig. 2a—c). We found that both the N-terminus and the 
LIR-motif-containing C-terminus, which flank the reticulon homo- 
logy domain of FAM134B, face the cytoplasm, suggesting a hairpin 
topology of the protein’*”” (Extended Data Fig. 2d, e). Liposome-bind- 
ing assays confirmed that FAM134B binds to membranes (Extended 
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Data Fig. 3a) and its incubation with liposomes resulted in a reduced 
liposome size (Extended Data Fig. 3b, c). Neuropathy-associated pro- 
teins lacking both the coiled-coil domain and the LIR-motif (S309X) 
showed weaker binding to membranes but retained the ability to shape 
them, whereas deletion of a part of the reticulon domain (Q145X) 
abolished the formation of small-diameter liposomes (Extended 
Data Fig. 3c). These data indicate that FAM134B is capable of remod- 
elling membranes in vitro via its reticulon homology domain. 

Transient overexpression of FAM134B in U20S cells stably expres- 
sing the ER marker protein KDEL-RFP led to pronounced morpho- 
logical changes and fragmentation of the ER. FAM134B-positive ER 
fragments co-localized with LC3B and CD63-positive lysosomes 
(Fig. 2a-f). In contrast, the ability of FAM134B to fragment and 
sequester ER into LC3-positive autophagosomes was abolished 
upon expression of a LIR-motif-deficient FAM134B (Fig. 2a, b). 
Overexpression of RTN4A, another reticulon-domain-containing 
protein’’, induced morphological ER changes, but did not lead to a 
formation of LC3B/KDEL-RFP-positive punctae (Fig. 2b). These 
observations suggest that FAM134B, through its LIR motif, acts as 
an ER-selective autophagy receptor. To dissect the fate of FAM134B- 
induced ER fragments, we performed transmission electron micro- 
scopy (Fig. 2c-f). Lysosomes and autolysosomes were more abundant 
in U2OS cells stably overexpressing eGFP-FAM134B compared 
to controls, an effect which was even more prominent upon inhibi- 
tion of lysosomal degradation by bafilomycin Al (Fig. 2c-e). 
Autolysosomes in bafilomycin Al-treated and eGFP-FAM134B- 
expressing cells showed an accumulation of ribosome-decorated ER 
fragments in their interior (Fig. 2e). The presence of ER-derived struc- 
tures in autolysosomes was confirmed by double immunogold labelling 
using antibodies against the lysosomal marker CD63 and the KDEL 
peptide (Fig. 2f). Double labelling with CD63 and GFP antibodies 
proved that eGFP-FAM134B was delivered into autolysosomes 
(Fig. 2f). ER turnover was further investigated by monitoring the degra- 
dation of the ER-resident protein SEC61B. Increased degradation of 
SEC61B was observed in FAM134B-overexpressing cells but not in 
cells expressing FAM134(mutLIR) (Extended Data Fig. 4a). Delivery 
of ER fragments into lysosomes was further confirmed in U2OS cells 
stably expressing FAM134B with a tandem fusion of mCherry and 
eGFP (mCherry-eGFP-FAM134B), as indicated by mCherry-positive 
but eGFP-negative punctae, owing to the higher stability of mCherry 
in acidic milieux’’ (Fig. 2g, h). In contrast, only few mCherry-positive 
and eGFP-negative punctae were observed in cells stably expressing 
mCherry—-eGFP-FAM134B(mutLIR) (Fig. 2g, h) or in autophagy- 
deficient Atg5’~ mouse embryonic fibroblasts (MEFs) expressing 
wild-type mCherry-eGFP-FAM134B (Fig. 2i and Extended Data 
Fig. 4b). Moreover, ER fragmentation and degradation was abolished 
after FAM134B overexpression in AtgS’ MEFs (Extended Data 
Fig. 4c-f). These data demonstrate that FAM134B is an ER-anchored 
autophagy receptor that mediates ER delivery into lysosomes through 
sequestration into autophagosomes (ER-phagy). Notably, this mech- 
anism is different from a recently described ER-phagy pathway in yeast 
that operates independently from the core autophagy machinery’’. 

We next performed short hairpin RNA (shRNA)-mediated 
FAM134B knockdown in U2OS cells stably expressing KDEL-RFP. 
FAM134B downregulation led to a striking ER expansion especially in 
the cell periphery (Fig. 3a-d and Extended Data Fig. 5a). Knockdown 
of two major autophagy regulators, ATG5 and BECNI, likewise trig- 
gered a comparable phenomenon (Fig. 3e-h). The ER expansion was 
accompanied by an increase in both CLIMP-63-positive membranes 
and increased cellular levels of CLIMP-63 and TRAP« proteins 
(Extended Data Fig. 5b-d). Reconstitution of knockdown cells with 
shRNA-resistant FAM134B variants (wild type, mutLIR, Q145X, 
S309X or Aret (lacking the reticulon domain)) revealed that only 
wild-type FAM134B rescues the ER-phenotype (Fig. 3i, j). Thus, ER 
remodelling and turnover by FAM134B requires both the reticulon 
domain and LIR-mediated targeting to autophagosomes. 
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Figure 3 | FAM134B and autophagy regulate ER volume and structure. 
a-d, ER expansion in cells lacking FAM134B. a, shRNA-mediated knockdown 
of FAM134B in U20S cells stably expressing KDEL-RFP. Representatives of 
ten images are shown. Scale bars, 10 tum. b, Knockdown efficiency determined 
by western blot. c, Quantification of cells with expanded ER. **P < 0.01, 
one-way ANOVA, error bars indicate s.d., n = 300. d, Quantification of ER area. 
**P < 0.0001, Mann-Whitney test, n = 25. e-h, Defective autophagy leads to ER 
expansion. e, shaRNA-mediated knockdown of ATG5 or BECN1 in U2OS cells 
stably expressing KDEL-RFP. Representatives of ten images are shown. Scale bars, 
10 um. f, Knockdown efficiency determined by western blot. g, Quantification of 
cells with expanded ER. **P < 0.01, one-way ANOVA, error bars indicate s.d., 
n = 300. h, Quantification of ER area. **P < 0.0001, Mann-Whitney test, n = 25. 
i,j, Rescue of ER expansion in FAM134B-knockdown U20S cells after expressing 
shRNA-resistant FAM134B proteins. i, Quantification of cells with expanded 
ER. **P < 0.01, one-way ANOVA, error bars indicate s.d., n = 300. 

j, Quantification of ER area. **P < 0.0001, Mann-Whitney test, n = 25. 


To study the consequence of Fam134b loss in vivo, we disrupted the 
mouse Fam134b gene by homologous recombination (Fig. 4a). An 
expanded ER and alterations of Climp-63-positive ER sheets were 
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Figure 4 | Impaired selective ER turnover and sensory neuropathy in 
Fam134b~“ mice. a, Cre-lox-mediated deletion of exons 6-8 of Fam134b 
(loxP site, black triangle). b, Expanded ER in Fam1 34b- MEFs. **P < 0.01, 
one-way ANOVA, error bars indicate s.d., n = 300. Representative of ten 
images is shown. Scale bars, 10 Jum. ¢, Protein turnover in MEFs after Earle’s 
Balanced Salt Solution (EBSS) starvation. d, e, Densitometry of ¢ normalized 
to vinculin in three (d) or four (e) independent experiments. *P < 0.05, **P < 
0.01, one-way ANOVA. f, Tail withdrawal latency from noxious heat stimulus 
(tail-flick assay, 45 °C). NS, not significant, *P < 0.05, Mann-Whitney U-test, 
error bars indicate s.e.m. (n = 13 for Fam134b*'*, n = 11 for Fam134b). 
g, Sensory nerve conductance velocities (NCVs) recorded from the mouse tail. 
*P < 0.05, one-way ANOVA (n = 6 for both Fam134b*'* and Fam134b~ at 


observed in MEFs isolated from Fam134b~ mice (Fig. 4b). However, 
autolysosome number, turnover of long-lived proteins, Lc3b proces- 
sing and p62 degradation were unaltered in Fam134b’” MEFs, sug- 
gesting that although Fam134b specifically affects ER morphology and 
turnover via autophagy proteins, it is dispensable for basal or induced 
bulk autophagy and other types of selective autophagy (Extended Data 
Fig. 6). Of note, cytoplasmic protein aggregates that accumulated after 
2h of treatment with puromycin were removed during a 3 h recovery 
in both control and Fam134b’~ MEFs (Extended Data Fig. 6m, n). 
Nutrient starvation has previously been shown to induce ER degrada- 
tion®"’, accordingly, starvation triggered rapid Fam134b degradation 
in MEFs (Fig. 4c and Extended Fig. 4f). To a lesser extent we observed 
starvation-induced degradation of other structural ER proteins includ- 
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6-8 months; n = 10 for Fam134b*!* and n = 12 for Fam134b'~ at 22-24 
months). h, Input-output curves of sensory nerve amplitudes at 12-15 months 
(n = 13 for Fam134b*'* and n = 11 for Fam134b). **P < 0.005, two-way 
ANOVA, error bars indicate s.e.m. i, Afferent sensory fibres cut distally to the 
dorsal root ganglia were quantified. (n = 7 for Fam134b*'* and n = 7 for 
Fam134b~/ ~). NS, not significant, *P < 0.05, Student’s t-test, error bars indicate 
s.e.m. j, k, Transmission electron miscroscopy images showing enlarged ER 
tubules and distorted Golgi cisternae in dorsal root ganglia tissue sections 
(45-week-old Fam134b"" mice). Arrowheads indicate representative examples 
for ER (j) and Golgi compartment (k). Scale bars: 1 1m (j); 0.5 um (k).n = 3 
animals for each genotype. M, mitochondria. 


ing Climp-63 and Rtn4 (Fig. 4c-e). Importantly, turnover of Climp-63 
and Rtn4 was significantly impaired in Fam134b’ MEFs upon nutri- 
ent starvation (Fig. 4c-e), revealing that Fam134b acts as a general 
autophagy receptor for ER-phagy and may contribute to the dynamics 
of both Climp-63-positive sheets and Rtn4-enriched tubules/edges of 
sheets. Impaired ER turnover in Fam134b-deficient MEFs and A549 
cells affected cell viability under various stress conditions and led to cell 
death by involving the mitochondrial apoptotic pathway (Extended 
Data Fig. 7). This, together with other reports’, suggests that a 
compromised ER homeostasis undermines long-term cell survival. 
Phenotypic analyses of aged Fam134b mice revealed a reduced 
sensitivity towards noxious heat, which indicates a loss of small noci- 
ceptive neurons (Fig. 4f). Sensory nerve conduction velocities were 
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reduced, and together with a severe reduction of the sensory amplitude 
suggested additional loss of myelinated axons (Fig. 4g, h). In line with 
this, sensory axon numbers of Fam134b~ mice were significantly 
reduced in aged animals (Fig. 4i). No changes of lower motor neurons 
were found, suggesting that neurodegeneration is restricted to peri- 
pheral sensory nerves (Extended Data Fig. 8a-f). Ultrastructural ana- 
lyses of the somata of sensory neurons in tissue sections revealed ER 
expansions in Fam134b- mice at 10 months of age, which were absent 
in wild-type littermates (Fig. 4j). Notably, Golgi cisternae were also 
dilated in 10-month-old Fam134b7~ animals (Fig. 4k). No obvious 
changes in other subcellular compartments were observed. The ultra- 
structure of sensory neurons before phenotypic disease onset was fur- 
ther examined by a transmission electron microscopy approach 
allowing the maintenance of the neuronal architecture”’. While neur- 
ites in cultured sensory neurons from 3-month-old Fam134b*/* and 
Fam134b“~ mice were indistinguishable, the ER in the cell body was 
swollen in approximately 60% of Fam134b ~~ neurons (Extended Data 
Fig. 9). The in vivo data support the role of Fam134b as an ER-phagy 
receptor and demonstrate its relevance for sensory axon maintenance. 

Insummary, we show that FAM134B is essential for ER homeostasis in 
vitro and in vivo, and that its reticulon domain and LIR motif are both 
required for ER-phagy (Extended Data Fig. 10). We propose that 
FAM134B promotes membrane remodelling and ER scission via the 
membrane-bending capacity of its reticulon domain, whereas the LIR 
motif is necessary to target the ER fragments to autophagosomes 
for lysosomal degradation. A similar coordination between organelle 
scission and autophagosomal sequestration has been proposed for 
mitophagy™*. 

Maintaining ER homeostasis via FAM134B-mediated autophagy is 
essential for cell survival, especially under stress conditions. This is 
particularly relevant in vivo, where FAM134B inhibition, coupled with 
age-associated attenuation of autophagy, could contribute to an accu- 
mulation of misfolded or aggregated proteins within the ER leading to 
impaired proteostasis, compromised neuronal survival, and progress- 
ive neurodegeneration. An accompanying paper identified two novel 
receptors for selective ER-phagy in yeast termed Atg39 and Atg40, 
with the latter harbouring a reticulon domain”, and showed that 
ER-phagy is required for ER turnover and yeast cell fitness under stress 
conditions. Thus, our studies unveil a novel evolutionarily conserved 
molecular pathway that regulates the turnover and structure of the ER, 
and provide insight into selective autophagy in physiological organelle 
dynamics and pathological conditions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Plasmids, antibodies and reagents. pcDNA3.1(+)-FAM134A-HA, pcDNA 
3.1(+)-FAM134B and pcDNA 3.1(+)-FAM134C were generated by subcloning 
FAM134A, FAM134B and FAM134C ORFs into pcDNA 3.1(+) (Invitrogen) 
from pOTB7-FAM134A (MHS1011-59267), pOTB7-FAM134B (MHS1011- 
9199640) and pBluescript-FAM134C (MHS1010-9204031), respectively (Thermo 
Scientific), using HindIII and Xhol cloning sites. pcDNA3.1(+)-FAM134A-HA 
construct contained a 50-nucleotide N-terminal deletion that did not affect the 
ORF. pcDNA3.1(+)-FAM134A/B(mutLIR) (DDFELLD LIR substituted by seven 
alanine residues) and pcDNA3.1(+)-FAM134B DH1/2 (lacking amino acids 
83-140 and 196-248) were generated by site-directed mutagenesis. pGEX4T1- 
hGABARAP(AG), pGEX4T1-hGABARAPLI(AG), pGEX4T1-hGABARAPL2 
(GATE-16)(AG), pGEX4T1-hMAP1LC3A(AG), pGEX4T1-hMAP1LC3B(AG) 
and pGEX4T1-hMAPI1LC3C(AG) have been described previously’. FAM134B 
was cloned into Gateway entry vector pDONR223 (Invitrogen) and then recom- 
bined further into pHage-eGFP (Invitrogen) by the gateway LR reaction 
(Invitrogen) to generate pHage-eGFP-FAMI134B as described’®. pHage-eGFP- 
FAM134B Aret (lacking amino acids 80-250), Q145X (stop codon at position 
145), S309X (stop codon at position 309), mutLIR (DDFELLD LIR motif 
substituted by alanine residues) and ACC (stop codon at position 464) were gen- 
erated by site-directed mutagenesis. FAM134B and the Aret mutant were deliv- 
ered into pcDNA3.1/N mCherry-eGFP and pET-60-DEST destination vectors 
by the Gateway LR reaction to generate mCherry-eGFP-FAM134B. pcDNA3.1/ 
N mCherry-eGFP was generated as follows: eGFP was added to pcDNA3.1/N 
mCherry destination vector (Invitrogen) through PCR amplification and liga- 
tion into pcDNA3.1/N mCherry using an Apal restriction site. To generate the 
FAM134B(mutLIR) double tag vector, DDFELLD LIR has been substituted 
with seven alanine residues. Other plasmids used in the study were peGFP- 
Cl (Clontech), KDEL-RFP (pmRFP-C1-KDEL-REP, a gift from J. Lippincott- 
Schwartz’), pcDNA3.1(+)-RTN4A (NogoA)-mycHis (a gift from S. 
Strittmatter”*), and GFP-SEC61 fusion (a gift from H.-P. Hauri and H. Farhan”). 
Mouse monoclonal antibodies. Anti-B-Actin (Abcam), anti-HA (Santa Cruz, sc- 
7392), anti-c-Myc (Santa Cruz, sc-40), anti-GFP (Santa-Cruz, sc-9996), anti-LC3B 
(MBL, M152-3), anti-mono- and -polyubiquitinylated conjugates (Enzo, BML- 
PW8810), anti-p62 (MBL, M162-3), anti-CLIMP-63 (Enzo, ALX-804-604), anti- 
LAMP1 (DSHB University of lowa, H4A3), anti-LC3 (NanoTools, 0260-100/LC3- 
2G6), anti-GM130 (BD Transduction Laboratories, 610823), anti-caspase-8 
(MBL, M058-3), anti-G-adaptin (BD Transduction), anti-vinculin (Sigma). 
Rabbit polyclonal antibodies. Anti-LC3B (MBL, PM063), anti-p62 (Enzo, 
PW9860) anti-dsRed (Clontech, 632496), anti-ATG5 (Cell Signaling, 2630), 
anti-BECN1 (Cell Signaling, 3495), anti-CKAP4 (ProteinTech, 16686-1-AP), 
anti-NogoA+B (Abcam, ab47085), anti-TRAPo (Abcam, ab133238), anti- 
PARP (Cell Signaling, 9542), anti-caspase-9 (Cell Signaling, 9502). 

Goat polyclonal antibody. Anti-Calregulin (Santa Cruz, sc-6467). 

For immunoelectron microscopy analyses, sections were labelled with a mono- 
clonal antibody anti-KDEL (Calbiochem) and a biotin-conjugated monoclonal 
antibody recognizing CD63 (Calbiochem). 

Secondary antibodies. HRP-conjugated secondary antibodies were purchased 
from Santa Cruz Biotechnology; Cy3- and Cy5-conjugated antibodies were from 
Jackson ImmunoResearch and Alexa488-conjugated antibodies were from 
Invitrogen. Secondary gold-labelled rabbit anti-goat, -mouse and -biotin antibod- 
ies were purchased from Nordic MUbio, Dako and Rockland Immunochemicals 
Inc., respectively. 

Chemicals. Bafilomycin A, MG132, CCCP and thapsigargin were purchased from 
Sigma. Tunicamycin was obtained from Santa Cruz while Ku-0063794 was from 
Axon Medchem. B-NGF was obtained from Preprotec. 

Antibody generation. Antisera were raised against synthetic peptide sequences 
chosen on the basis of predicted antigenicity and uniqueness within the family of 
FAM134 proteins. Peptides were synthesized with N- or C-terminal cysteine 
(indicated in brackets in the amino-acid sequences given below) and coupled to 
the keyhole limpet hemocyanin (KLH) carrier protein. For FAM134A, rabbits were 
immunized against the peptide GLGPEMPPKPPDVLP(C) (amino acids 503-517; 
NP_739561.2), for FAM134B against (C)GYTPQTDTSDDLDRP (amino acids 
307-321; NP_001030023.1) and (C)LPTELKRKKQQLDSAHR (amino acids 
352-368; NP_001030023.1), for FAM134C against (C)AFRGRRAMSGSWE 
RDQ (amino acids 18-33; NP_080777.1). Resulting antisera were purified on an 
affinity matrix made by conjugation with the same peptide. 

shRNA knockdown. Annealed oligonucleotides were cloned into pLKO.1puro” 
using Agel and EcoRI cloning sites. pLKO.1 puro shRNA control was provided by 
D. Popovic (see ref. 31). The following pairs of oligonucleotides were used to target 
specified genes: FAM134B no. 1, CCGGGAGGTATCCTGGACTGATAATCT 
CGAGATTATCAGTCCAGGATACCTCTTTTTG and AATTCAAAAAGAGG 
TATCCTGGACTGATAATCTCGAGATTATCAGTCCAGGATACCTC; FAM 
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134B no. 2, CCGGCACAAGGATGACAGTGAATTACTCGAGTAATTCACT 
GTCATCCTTGTGTTTTTG and AATTCAAAAACACAAGGATGACAGTG 
AATTACTCGAGTAATTCACTGTCATCCTTGTG; ATG5, CCGGCATCTG 
AGCTACCCGGATAATCTCGAGATTATCCGGGTAGCTCAGATGTTTTTG 
and AATTCAAAAACATCTGAGCTACCCGGATAATCTCGAGATTATCCG 
GGTAGCTCAGATG; BECN1, CCGGAAGATTGAAGACACAGGAGGCCTC 
GAGGCCTCCTGTGTCTTCAATCTTTTTTTG and AATTCAAAAAAAGAT 
TGAAGACACAGGAGGCCTCGAGGCCTCCTGTGTCTTCAATCTT. 
Liposome-shaping assay. The liposome-shaping assay was performed as 
described previously”*. For freeze-fracture experiments, 1 mg of liposomes was 
incubated with 5 1M protein in liposome buffer containing 0.3 M sucrose for 15 
min at 37 °C. Small aliquots (1 to 2 11) of vesicle suspension were freeze-fractured. 
The samples were examined using an EM 902 A electron microscope (Zeiss). 
Images were acquired using a 1k FastScan CCD camera (TVIPS). Diameters of 
approximately 1,600 to 2,600 liposome structures per incubation condition were 
determined using ImageJ software. 

Liposome co-floatation assay. The liposome co-floatation assay was performed 
as described in ref. 20. Liposomes (150 ug) were incubated with 0.5 1M protein for 
15 min at 37 °C in 0.3 M sucrose in liposome buffer (25 mM HEPES-KOH, pH 7.2; 
25 mM KCI; 2.5 mM Mg-acetate; 100 mM K-glutamate) and mixed with 150 ul 
75% sucrose in liposome buffer. The mixture was overlaid with 200 pl 35% sucrose 
and 200 tl liposome buffer and then centrifuged at 200,000g for 30 min at 28 °C. 
Six fractions, each containing 100 Ll of eluate, were collected from top to bottom 
and analysed by SDS-PAGE and immunoblotting. 

Cell culture. Human Embryonic Kidney 293 cells ((HEK293T) (ATCC), U20S, 
A549, WT and Atg5’ ~ MEFs**)) were maintained at 37 °C with 5% CO, in DMEM 
medium (Gibco) supplemented with 10% fetal calf serum (Gibco) and 100 U ml“! 
penicillin and streptomycin (Invitrogen). Starvation was conducted by incubating 
cells in EBSS medium (Invitrogen). Cells were treated with 10 14M KU-0063794, 
200 ng ml’ bafilomycin Al, 1 1M thapsigargin and 2 tg ml! tunicamycin for 
the indicated time periods. Primary MEFs were isolated from embryos and 
immortalized with the SV40 large T antigen. All cell lines were checked for myco- 
plasma presence using Venor GeM Classic mycoplasma detection kit for conven- 
tional PCR (MB Minerva biolabs). Primary sensory neurons from mice were not 
tested for mycoplasma due to the short duration in culture. 

Generation of stable cell lines. To generate U2OS cells stably expressing KDEL- 
REP, mCherry-eGFP-FAM134B and mCherry-eGFP-FAM134B(mutLIR) cells 
were transfected with GeneJuice (Merck), followed by selection in DMEM med- 
ium containing 600 jg ml * geneticin (G418, Invitrogen) for 15-21 days. 

U20S cells stably expressing shRNA oligonucleotides were generated by lenti- 
viral infection as described previously’. A shRNA-resistant pHage-eGFP- 
FAM134B variant (against shRNA FAM134B #1) was generated by site-directed 
mutagenesis and was stably introduced in U2OS cells via lentiviral transduction 
and selection in medium containing 10 pg ml blasticidin (Invitrogen). 
Apoptosis analysis by FACS. Apoptosis assays on wild type and Fam134b— 
MEFs were performed using Annexin V-FITC Apoptosis detection Kit (Enzo 
Life Sciences) according to the manufacturer’s protocol. Numbers of annexin- 
V- and propidium-iodide-positive cells were detected using BDFACS Canto II 
and data were processed with FlowJo program. 

Protein aggregate clearance assay. Puromycin treatment (5 1g ml ') was carried 
out for 2 h in wild-type and Fam134b- MEBs. After treatment, cells were either 
fixed in 4% PFA for 10 min or washed three times with fresh medium, without 
puromycin, and incubated for a further 3 or 6 h prior to fixation and immuno- 
fluorescence analysis. 

Subcellular fractionation. Fractionation was performed similar to ref. 33. 
Samples were loaded on a linear 0-20% iodaxinol gradient. Centrifugation took 
place in a SW28 swing-out rotor for 115 min at maximum speed (28,000 r.p.m.). 
After centrifugation 600-1] fractions were taken (top to bottom) and subjected to 
western blot analyses. 

Protein expression and purification. FAM134B (450-459) and LC3A (2-120) 
chimaera was cloned into pGEX6P1 and expressed as glutathione-S-transferase 
(GST) fusion proteins in Escherichia coli BL21 DE3. Cells were lysed by sonication 
in lysis buffer (25 mM Tris, 100 mM NaCl, pH 8.5), and the lysate was cleared by 
centrifugation. The expressed protein was purified by glutathione-Sepharose 4B, 
cleaved by PreScission Protease, and further purified by size-exclusion chromato- 
graphy (HiLoad 16/600 Superdex 75 column, GE Healthcare Life Sciences) in 25 
mM Tris, 100 mM NaCl, pH 8.5. 

Crystallization, X-ray data collection and structure determination. FAM134- 
LIR-LC3A crystals were obtained using 8% Tacsimate (Hampton Research) pH 
8.0, 20% Poly ethylene glycol 3350 as a reservoir solution by the sitting-drop 
vapour diffusion method at 293 K. Data were collected at Diamond Light 
Source beam line 103 and processed with iMOSFILM. The crystal structure of 
FAM134B-LIR-LC3A was determined by molecular replacement with Molrep 
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using the LC3A structure (Protein Data Bank ID: 3ECI) as a search model. Manual 
model building and refinement were done with Coot and Phenix. The final stat- 
istics of refined models are shown in Supplementary Table 2 and the atomic 
coordinates have been deposited in the Protein Data Bank under accession num- 
ber 4ZDV. 

GST pull-down. HEK293T cells were transfected with indicated constructs using 
GeneJuice (Merck) for 24h. Cells were lysed in lysis buffer (50 mM HEPES, pH 7.5, 
150 mM NaCI, 1 mM EDTA, 1 mM EGTA, 1% Triton X-100, 10% glycerol, 25 
mM NaF supplemented with PMSF (Sigma), aprotinin (Sigma), leupeptin 
(Biomol), and sodium vanadate (Sigma)). Lysates were cleared by centrifugation 
at 12000g for 10 min, and incubated with GST-fusion-protein-loaded beads for 
2 h. Beads were then washed three times in lysis buffer, resuspended in Laemmli 
buffer and boiled. Supernatants were loaded on SDS-PAGE. 

Endogenous co-immunoprecipitation between FAM134B and LC3B. Three 
confluent 15-cm diameter Petri dishes of wild-type and Fam134b’~ MEFs were 
harvested in 50 mM Tris/HCl (pH 8), 120 mM NaCl, 1% (v/v) NP40, complete 
protease inhibitor cocktail (Roche), and 1 mM PMSF. Lysates were cleared by 
centrifugation at 12000g for 10 min, and incubated over night at 4°C with 
FAM134B primary antibody. Lysates and FAM134B antibody solutions were 
incubated with Protein A-Agarose beads (Roche) for 4 h at 4 °C. Beads were then 
washed three times in lysis buffer, resuspended in Laemmli buffer and boiled. 
Supernatants were loaded on SDS-PAGE. 

Long-lived protein degradation assay. The turnover of long-lived proteins upon 
autophagy induction in wild-type and Fam134b-knockout MEFs was measured as 
previously described™. 

Immunofluorescence microscopy. Cells were grown on coverslips and after each 
experiment fixed with 4% PFA for 20 min. Cells were permeabilized with 0.1% 
saponin in PBS for 30 min and blocked with 5% bovin serum albumin (BSA) in 
PBS for 1 h. Subsequently the coverslips were incubated in a wet chamber with 
primary antibody solution for 2 h, washed in the permeabilizing buffer and then 
incubated with secondary antibody solution for 1 h. Coverslips were finally washed 
three times and mounted on microscope slides using the mounting medium 
(Molecular Probes). Fluorescence signals were captured using a LSM510 META 
(Zeiss) or a Leica TCS SP8 confocal laser-scanning microscope. Images were 
processed using Image] software. Co-localization was analysed using Volocity 
Demo software (Perkin Elmer). 

Fluorescence protein protection (FPP) assay. The FPP assay was carried out 
with COS-7 cells transiently transfected with FAM134B-eGFP- (C-terminal eGFP 
tag), eGFP-FAM134B- (N-terminal eGFP tag), CD3-RFP-, or RFP-KDEL- 
expressing plasmids (Lipofectamine2000, Invitrogen). Twenty four hours after 
transfection cells were transferred to a perfusion chamber (37°C) and kept in 
live-cell buffer (50 mM HEPES, 23 mM NaCl, 3 mM MgCh, 100 nM CaCh, 
1mM EGTA, 107 mM glutamate, 1 mM ATP, 2 mM DTT). Images were recorded 
with a 20X objective every 20 s for 10 min at an inverted microscope (Axio 
Observer Z.1; AxioCam MRm, Zeiss). After plasma membrane permeabilization 
with digitonin (18 11M, 80 s) cells were washed for 80 s with live-cell buffer. 
Subsequently, trypsin (1 1M) was added to the cells. ImageJ was used for fluor- 
escence quantification measurements. Mean fluorescent values of single cells were 
subtracted from background fluorescence and normalized to fluorescence at the 
starting time point. 

Protein sequence alignment. The multisequence FAM134 protein sequence 
alignment was generated with ClustalW using the default settings. 

ER area measurements. Cells with expanded ER were visually defined if KDEL- 
REP-labelled ER occupied over approximately 80% of cell area. Additionally, the 
ER area measurements were conducted using ImageJ as follows. Background 
threshold was manually defined and set for all images. Borders of each cell were 
drawn and the ER area was then calculated and presented as a fraction of the total 
cell area. 

Ultrastructural analyses. For conventional TEM, U20OS cells transfected with 
either control or FAM134B shRNA, or overexpressing eGFP-FAM134B treated 
or untreated with 200 nM bafilomycinA1, were scrapped from culture flask and 
pelleted by centrifugation. Pellets were resuspended in 2% glutaraldehyde in 0.1 M 
cacodylate buffer (pH 7.4) for at least 2 h at room temperature. Fixative was 
replaced by cacodylate buffer and fixation was prolonged overnight. Cells were 
then embedded in Epon resin as previously described**. Subsequently, 55-nm 
sections were cut and stained with uranyl acetate and lead citrate using the EM 
AC-20 contrasting instrument (Leica Microsystems). For the immunoelctron 
microscopy analyses, U2OS cells stably expressing eGFP-FAM134B and grown 
to 80% confluence were treated with 200 nM bafilomycin A1 for 6 h. Cells were 
fixed in an equal volume as the culture medium by adding a freshly prepared mix 
of 4% PFA and 0.4% glutaraldehyde in 0.1 M phosphate buffer (pH 7.4) for 3 h, 
followed by a post-fixation in 2% PFA/0.2% glutaraldehyde for 2 h at room 
temperature and an overnight incubation at 4 °C. Cells were then embedded for 


the Tokuyasu procedure before cutting ultrathin cryo-sections and immuno-gold 
labelling as previously described’*. Cell sections were examined using an 80 kV 
transmission electron microscope (Jeol 1200-EX). 

To preserve the neuron native morphology, a monolayer culture of dorsal root 
ganglia (DRG) neurons was embedded in Epon resin in their original position in 
25-cm‘ plastic culture flasks following the flat embedding procedure”. After Epon 
resin polymerization at high temperature, the embedded material was stripped off 
the plastic flask using needle-nose pliers and forceps. Small circular specimens 
(=0.5 cm in diameter) of Epon-embedded material were cut using a drill bit 
(mounted with a rotating cutting tool) and glue onto supports for ultra-micro- 
tome. Ultrathin serial sections of 50 nm were cut parallel to the neuron monolayer 
using an ultra-microtome (Leica Microsystems) and transferred onto Formvar 
carbon-coated copper grids for staining using the EM AC-20 contrasting instru- 
ments (Leica Microsystems). 

Generation of Fam134b-knockout mice. A 10.4-kb fragment of the 129/SvJ 
mouse genomic lambda phage library (Stratagene) containing exons 5 to 9 of 
the mouse Fam134b gene was subcloned into the pKO-V901 vector containing 
a diphtheria toxin A (DTA) cassette (Lexicon Genetics). A pgk-promoter-driven 
neomycin resistance cassette (neo) flanked by FRT-sites and a 5’ loxP site was 
inserted into an EcoRV site in intron 5 of Fam134b. A second loxP site together 
with an additional 3’ EcoRI site was inserted into the Clal site of exon 8. The 
targeting construct was linearized and electroporated into R1 mouse embryonic 
stem (ES) cells. Homologous recombination was verified by Southern blot analysis 
of neomycin-resistant clones using EcoRI and an external 498-bp Southern probe. 
Correctly targeted embryonic stem cell clones were injected into C57BL/6 blas- 
tocysts to generate chimaeras. Resulting chimaeric mice were crossed with Cre- 
deleter mice to remove exons 6-8 of Fam134b. For PCR genotyping, DNA was 
isolated from tail biopsies by alkaline lysis and used in a single PCR mix containing 
the primers: F1, 5’-ACCCCATAGTTCATACTAGGC-3’; R1, 5’-CGTAACA 
GAGGTTGGTGAGG-3’; and F2, 5’-CATGGCAATGACATTTCTCC -3’. A 
product size of 280 bp was obtained for the wild-type allele and a 420-bp amplicon 
for the knockout allele. Studies were performed in a mixed 129SvJ/C57BL/6 back- 
ground in the F4 and F5 generations. Animals were chosen from the animal house 
database without prior inspection/pre-selection of animals. Both male and female 
mice were used, except for tail-flick experiments, whihc included only male mice. 
Age of the mice and sample size is given for each experiment. The experimenter 
was blinded to the genotype of the mice in all behavioural tests and for axon 
counting. Animal care and experimental procedures were performed in accord- 
ance with the guidelines established by the animal welfare committee of the State 
of Thuringia. 

FAM134B protein expression analysis. MEFs were prepared from embryonic- 
day-13.5 mouse pups as described previously*’. Primary MEFs were harvested in 
50 mM Tris/HCl (pH 8), 120 mM NaCl, 0.5% (v/v) NP40, complete protease 
inhibitor cocktail (Roche), and 1 mM PMSF. Homogenates were incubated on 
ice for 15 min and centrifuged for 30 min at 16,900g. Thrity micrograms of 
supernatant protein was subjected to SDS-PAGE and western blot analysis using 
an affinity-purified antibody against the Fam134b-specific peptide LPTELKRKK 
QQLDSAHR. 

Recording of tail sensory nerve conductance velocities. Mice were anaesthetized 
and body temperature was maintained with a warming pad. Tail sensory nerve 
conduction velocities (TSNCVs) were recorded 30 mm proximal to the stimu- 
lation electrodes. Sensory action potentials were evoked with increasing intensity 
(0-10 V, increment 1 V, 50 ts duration). Electrodes with a tip diameter of 1-2 um 
and an impedance of 0.1 MQ were used (WE30030.1H10, Science Products). 
Signals were processed with an extracellular amplifier (EXT 02F, NPI; high-pass 
filter: 3 Hz, low-pass filter: 2 kHz). The sample frequency was 20 kHz. For analysis 
Signal 3 software (CED) was used. TSNCVs were calculated by dividing the 
distance by the take-off latency of five averaged sensory potentials. Amplitudes 
were determined from peak to peak. 

Pain-related behavioural assays/tail-flick assay. Thermal stimulation (at 45 °C) 
was applied to the tail of adult male mice after their habituation to the test 
environment using the water immersion method”. The time from onset of stimu- 
lation to rapid movement of the tail was recorded. Two separate determinations of 
tail-flick latency per mouse were recorded and averaged. Statistical comparisons of 
two groups of data were made using the Mann-Whitney U-test. The experimenter 
was blinded to the genotype of the mice. 

Histological analysis of spinal cord and afferent sensory fibres. Murine tissue 
was fixed by in vivo perfusion of fixative solution containing 4% paraformaldehyde 
and 1% glutaraldehyde in PBS. The fourth lumbal DRG and spinal cord was 
isolated, post-fixed overnight at 4 °C in fixative and further processed for section- 
ing. Afferent sensory fibre bundles distal to the DRG were counted using Image] 
software. Cross-sections of the spinal cord with a thickness of 10 1m underwent 
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the cresyl violet staining protocol for quantification of Nissl-stained motor neuron 
cell bodies in the ventral horn. 

DRG cultures. Primary DRG neurons were prepared from adult mice killed via 
cervical dislocation. DRGs were extracted from the spinal cord and rinsed 
three times with Hank’s buffered salt solution (HBSS) followed by an incuba- 
tion in collagenase solution (3 mg ml” solved in HBSS) for one hour at 
37°C. Activated trypsin was added to a final concentration of 0.1% and 
DRGs were incubated for further 10 min at 37°C. DRGs were rinsed three 
times with HBSS and then dissociated by using BSA-blocked, fire-polished 
glass Pasteur pipettes of decreasing diameters. The single cell suspension 
was centrifuged for 5 min at 160g and supernatant was removed from the 
cell pellet. DMEM containing 10% horse serum was added to the cell pellet 
and again centrifuged for 5 min at 160g. The supernatant was carefully 
removed and cells were resuspended in Neurobasal-A medium supplemented 
with 2 mM L-glutamine and 2% B27. Cells were plated on poly-L-lysine coated 
cover slips or T25 cell culture flasks and placed at 37°C in a 5% CO, atmo- 
sphere. Thirty minutes after plating, medium was replaced by fresh culture 
medium containing B-NGF (50 ng ml~'). 

Ultrastructural analysis of DRG sensory neurons and motoneurons of the 
spinal cord in vivo. Tissue of DRG and spinal cord was fixed in 4% glutaralde- 
hyde. The tissues were post-fixed with osmium tetroxyde. Ultrathin sections were 
treated with uranyl acetate and lead citrate and viewed with a Philips CM10 
transmission electron microscope. 

Statistical analysis. Data are presented as mean + s.e.m. or s.d. Statistical analysis 
of two experimental groups was performed using one-way ANOVA or parametric 
two-tailed Student’s t-test after confirming a parametric distribution by Normal 
Q-Q plots. For non-parametric distributed data, the Mann-Whitney U-test was 
applied. If more than two groups were compared, one-way ANOVA was per- 
formed, and differences between subjects were analysed by a subsequent 
Bonferroni’s or Tukey honest significant difference test. If necessary, two-way 
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ANOVA was applied. Results were considered significant at P < 0.05. No statist- 
ical methods were used to predetermine sample size. 
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Extended Data Figure 1 | FAM134 proteins bind to GST-LC3-like 
modifiers. a, A549 cell lysates were added to beads with various immobilized 
GST fusion proteins (GST, GST-LC3A, GST-LC3B, GST-GABARAP, GST- 
GABARAP-L]), followed by western blotting using an antibody against 
FAM134B. b, Co-immunoprecipitation between Fam134b and Lc3b in wild- 
type (WT) mouse embryonic fibroblasts (MEFs). MEFs isolated from 
Fam134b-knockout (KO) mice served as control. c, Interaction of FAM134A 
and FAM134C with LC3-like modifiers. d, FAM134B proteins lacking the 
reticulon domain (Aret), disease-associated truncating FAM134B variants 
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Q145X and $309X, and FAM134B with a mutated LIR-motif (mutLIR) fail to 
bind GST-LC3-like modifiers (LC3B) in contrast to wild-type FAM134B 

and FAM134B lacking the coiled-coil domain (ACC). e, Crystal structure of 
a FAM134B-LIR peptide interacting with LC3A. Ribbon and surface represen- 
tation model of the FAM134B-LIR-LC3A interaction and crystallographic 
symmetry related molecule. (LIR, magenta; LC3A, yellow). 2F, — F, electron 
density (blue mesh) of FAM134B-LIR (amino acids 453-459), contoured at 1o 
and with ball-and-stick model. 
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Extended Data Figure 2 | Subcellular localization and topology of 
endogenous FAM134B. a, The post-nuclear fraction of wild-type MEF lysate 
was loaded on a linear Iodixanol gradient (0-20%). Fractions (top to bottom) 
were subjected to western blot and analysed with antibodies directed against 
endogenous proteins as indicated. Images are from two different gels, while the 
exposure time is the same. Asterik indicates non-specific band. b, A549 cells 
and MEFs were transfected with a KDEL-RFP expression plasmid for 24 h. 
After fixation, endogenous FAM134B was detected. FAM134B co-localizes 
with KDEL-RFP. Representatives of five images are shown. Scale bar, 10 Lm. 
c, HeLa and U20S cells were fixed and stained for endogenous FAM134B 
and CLIMP-63. FAM134B co-localizes with CLIMP-63. Representatives of 
five images are shown. Scale bar, 10 jim. d, FAM134B topology analysis. 
COS-7 cells transiently overexpressing C-terminal tagged FAM134B 
(FAM134B-eGFP) or N-terminal tagged FAM134B (eGFP-FAM134B) were 
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subjected to fluorescence protein protection (FPP) assay. COS-7 cells 
transfected with plasmids encoding the luminal ER peptide RFP-KDEL and 
C-terminal RFP-tagged CD3 (CD3-REP) served as controls. RFP-KDEL, 
which localized to the ER lumen, shows fluorescence protein protection upon 
trypsin administration following digitonin treatment. By contrast, according 
to the known topology of CD3-RFP the RFP-tag faces the cytosol and as a 
consequence trypsin treatment abolishes protein fluorescence. Scale bar, 

10 um. e, C-terminal RFP-tagged CD3 (CD3-REP) served as control (RFP-tag 
faces the cytosol). A N-terminal tagged FAM134B variant was also subjected 
to the FPP assay. A strong decrease in fluorescence was observed for CD3- 
REP, FAM134B-eGFP and eGFP-FAM134B after sequenced digitonin and 
trypsin treatment, but not for RFP-KDEL (n = number of cells, error bars 
indicate s.e.m.). 
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Extended Data Figure 3 | FAM134B is a membrane-shaping protein that 
remodels lipid bilayers. a, Liposome co-floatation assays. Proteins were 
detected using anti-GST antibodies in immunoblots of sucrose gradient 
fractions 1 (top) to 6 (bottom). GST-FAM134B, but not GST (Ctrl), floated 
with liposomes in fraction 2. Disease-related truncating mutations $309X and 
Q145X only partially floated with the liposome fraction. b, Representative 
transmission electron microscopy (TEM) images of freeze-fractured 
incubations of liposomes with recombinant GST-fusion proteins (GST- 
FAM134B; GST-FAM134B(Q145X); Ctrl, GST). Scale bar, 200 nm. 
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c, Distribution of liposome diameters observed by TEM of freeze-fractured 
liposome incubations. Incubation with FAM134B and S309X leads to a 
pronounced increase in the relative numbers of smaller structures in compari- 
son to control (GST) and Q145xX. Inset, box plots of data presented in ¢; y axis is 
logarithmic. n.s., not significant; ***P < 0.001 one-way ANOVA, error bars 
indicate s.e.m. (n = 2,682 for Ctrl, n = 2,683 for FAM134B, n = 1,685 for 
Q145X, n = 1,612 for S309X, n = number of liposomes). Boxes contain 

50% of the values; minimal, maximal, and median values are marked by 
vertical lines. 
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Extended Data Figure 4 | Fam134b determines ER degradation through 
autophagy. a, U2OS cells transiently co-transfected with 0.25 jug of GFP- 
SEC61 plasmid expressing full length or mutLIR FAM134B-HA at the 
indicated quantities (11g). Cell lysates were immunoblotted with antibodies 
against HA, GFP and vinculin. Black arrow-head indicates protein-degradation 
products. b, Wild-type (WT) and Atg5-knockout (KO) MEFs transiently 
expressing mCherry-eGFP—FAM134B were fixed and processed for immuno- 
fluorescence analysis. Cells with mCherry-positive and simultaneously GFP- 
negative punctae were counted in three independent experiments (biological 
replicates). Representative of 50 images is shown. Scale bar, 10 Lm. ¢, d, 
Atg5-knockout and control MEFs were co-transfected with plasmids 
expressing full-length LIR (c) or mutLIR (d) FAM134B-HA and KDEL-RFP 
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for 24h, fixed and processed for immunofluorescence using antibodies against 
the HA tag and Lc3b. Overexpression of FAM134B-HA in wild-type but not 
in Atg5-knockout MEFs leads to the formation of punctae positive for both 
KDEL-RFP and Lc3b. Representatives of five images are shown. Scale bar, 

10 jim. e, Quantification of cells displaying KDEL-RFP- and Lc3b-positive 
punctae shown in c and d. At least 50 cells per experiment from three 
independent experiments (biological replicates) were quantified (error bars 
indicate s.d.). f, Fam134b is stabilized in starved Atg5-knockout MEFs. Wild- 
type and Atg5-knockout MEFs were starved in EBSS for the indicated periods 
of time. Cell lysates were processed by SDS-PAGE and western blot using 
antibodies against Fam134b and vinculin. 
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Extended Data Figure 5 | FAM134B knockdown causes ER expansions. 

a, Ultrastructural analysis of FAM134B-depleted cells. Constructs expressing 
control shRNA or &-FAM134B shRNA were lentivirally delivered into U2OS 
cells. Cells were chemically fixed and embedded with Epon resin. Longitu- 
dinal and transversal sections and a part of the nuclear membrane is shown. 
FAM134B-depleted cells display ER expansions, particularly in the cell 
periphery. The membrane of the nucleus of FAM134B-knockdown cells also 
undergoes expansion. M, mitochondria; N, nucleus; PM, plasma membrane. 
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Scale bars, 2 |1m (left images) and 500 nm (middle and right images) (n = 150 
cells). b, c, ER sheets are expanded in FAM134B-deficient cells. Constructs 
expressing control shRNA or o&-FAM134B shRNA were lentivirally delivered in 
U20S cells. After selection, cells were fixed, stained for CLIMP-63 and 
RTN4A and RTN4B and analysed by fluorescence microscopy. Representatives 
of five images are shown. Scale bar, 10 jm. d, CLIMP-63 and TRAPz levels 
in autophagy-deficient and FAM134B-depleted cells. 
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Extended Data Figure 6 | FAM134B is not required for bulk autophagy and 
aggrephagy. a, Fam134b‘/* and Fam134b- MEFs were starved in EBSS for 2 
h, fixed and stained for Lc3b (using two different anti-Lc3b antibodies to 
enhance the signal). Representatives of five images are shown. Scale bar, 10 um. 
b, Lc3b-positive punctae in 50 cells per experiment from three independent 
experiments were counted. *P < 0.0001, one-way ANOVA. c, Long-living 
proteins degradation assay. For autophagy induction, cells were starved with 
EBSS. Protein degradation was assessed in three independent experiments in 
triplicate. Error bars represent s.e.m. of three independent counts (three 
technical replicates, n = 20 cells per each replicate, P value is calculated using 
t-test, *P < 0.05). d, Transmission electron microscopy of Fam134b*’/* and 
Fam134b~“~ cells. Autophagosomal/degradative compartments (that is, 
autophagosomes, autolysosomes and lysosomes) were counted. Quantification 
was performed by counting 20 cells in three different grids (three biological 
replicates). Error bars represent s.d., P value is calculated using t-test. e, Lc3b 
lipidation and p62 degradation was analysed in Fam134b*/* and Fam134b~ 
MEFs treated with 200 nM bafilomycin A1 for the indicated time. f, g, For 
autophagy induction, cells were starved with EBSS (f) or treated with the 
chemical Ku-0063794 (10 41M) (g) for the indicated periods of time. 


Bafilomycin A1 was added 1 h before the beginning of the treatment. h, Control 
and FAM134B-knockdown (shRNA-mediated) U2OS cells were starved in 
EBSS for 2 h, fixed and stained for LC3B (using two different anti-LC3B 
antibodies to enhance the signal). Representatives of five images are shown. 
Scale bar, 10 jim. i, LC3B-positive punctae in 150 cells per experiment from 
three independent experiments were counted. *P < 0.0001, Mann-Whitney 
U-test. j, LC3B lipidation and p62 degradation were analysed in control and 
FAM134B shRNA cells treated with 200 nM bafilomycin A1 for the indicated 
time. k, 1, For autophagy induction, cells were starved with EBSS (k) or treated 
with the chemical Ku-0063794 (10 [1M) (1) for the indicated periods of time. 
Bafilomycin Al was added 1 h before the beginning of the treatment. 

m, Fam134b*’* and Fam134b’~- MEFs were either treated with 5 ug ml! 
puromycin for 2 h or treated and subsequently washed and incubated in 
puromycin-free medium for 3 h, fixed and stained for ubiquitin (Ub) and p62. 
Representatives of five images are shown. Scale bar, 10 im. n, The number of 
cells with Ub/p62 punctae before and after puromycin release per 100 cells 
per experiment was determined from three independent experiments 
(biological replicates). Error bars indicate s.d. 
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Extended Data Figure 7 | FAM134B deficiency sensitizes cells to apoptosis. 
a, b, Fam134b*’* and Fam134b’- MEFs were treated with 25 pM CCCP 
(carbonyl cyanide 3-chlorophenylhydrazone) (a) or starved in EBSS(b) for the 
time indicated. Cell lysates were processed by SDS-PAGE and western blot 
using antibodies against vinculin, Parp and Fam134b. c, Fam134b*’* and 
Fam134b" MEEs were treated with 2 jg ml’ tunicamycin (Tm) or 1 pM 
thapsigargin (Tg) for 12 h or left untreated. Cell lysates were processed by SDS- 
PAGE and western blot using antibodies against vinculin and Parp. d, FACS 
analysis for annexin V and propidium iodide (PI) in Fam134b*’* and 
Fam134b~ MEFs. Quantifications of annexin V/propidium iodide-positive 
cells after the indicated experimental settings (EBSS starvation for 8 h, 
treatment with 1 1M thapsigargin, 2 wg ml * tunicamycin, 25 WM CCCP for 
12 h and 200 nM staurosporin (STS) for 6 h). Data are shown as mean = s.d. of 
two independent biological replicates; for each biological replicate two 
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experiments were performed and for each experiment 10,000 cells were 
analysed. **P < 0.01, one-way ANOVA. e, A549 cells stably expressing control 
and anti-FAM134B no. 1 and no. 2 shRNAs were either starved in EBSS or 
treated with 10 1M Ku-0063794, 200 ng ml! bafilomycin Al (BafA), 1 uM 
thapsigargin or 2 pg ml tunicamycin for 12 h or left untreated. Cell lysates 
were processed by SDS-PAGE and western blot using antibodies against 
vinculin and PARP. f, g, FAM134B knockdown induces apoptosis involving the 
mitochondrial pathway. A549 cells stably expressing control and anti- 
FAM134B no. 2 shRNAs were starved in EBSS for the time indicated. Cell 
lysates were processed by SDS-PAGE and western blot using antibodies against 
vinculin, PARP and caspase 8 (f) or caspase 9 (g). a—c, e-g, Filled arrowheads 
indicate processed PARP and caspase 8/9; empty arrowheads indicate 
unprocessed PARP and caspase 8/9. 
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Extended Data Figure 8 | Fam134b deficiency does not affect spinal cord 
motor neurons. a, Western blot analysis of mouse embryonic tissue lysates 
with a Fam134b antibody. b, Transmission electron microscopy analysis of 
Fam134b*/* and Fam134b“~ MEFs. Cells lacking Fam134b display an 
expanded ER. M, mitochondria. Scale bar, 1 jum; n = 150 cells. c, Compound 
muscle action potential (CMAP). CMAP latencies were recorded from tail 
nerves of wild-type (+/+) and knockout (—/—) mice at the ages indicated. No 
significant differences were observed between Fam134b'’* and Fam134b /~ 
mice. One-way ANOVA, error bars indicate s.e.m.; n = 6 for +/+ and —/— at 
6 months; n = 10 for +/+ and n = 13 for —/— at 12 months. n.s., not 
significant. d, Motor neurons in Nissl-stained thoracic spinal cord sections (10- 
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um thick) of Fam134b*/* and Fam134b/~ mice. Scale bar, 20 um. 
Representatives of seven images per genotype are shown. e, Quantification of 
motor neuron cell bodies in the ventral horn. Motor neuron number was 
unchanged in Fam134b*/* and Fam134b ’~ mice at an age of 12 and 20 
months (n = 7 for +/+ and n = 7 for —/—). n.s., not significant, Student’s 
t-test, error bars indicate s.e.m. f, Normal ultrastructure of motor neurons in 
spinal cord sections of 12-month-old mice. Arrows in the middle and right 
panels highlight that there were no observed alterations in ER and Golgi 
architecture, respectively. Scale bar, 1 um. Representatives of three images per 
genotype are shown. 
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Extended Data Figure 9 | Ultrastructural analysis of dorsal root ganglia 
neurons in Fam134b~’~ mice. a-e, Ex vivo analysis of dorsal root ganglia 
(DRG) neurons. DRG neurons from 3-month-old Fam134b*/* and 
Fam134b /~ littermates were cultured for 2 days before transmission electron 
microscopy using the flat embedding approach. a, b, The ultrastructural 
architecture of the peripheral ER as found in the DRG neuron cell body above 
or below the nucleus (a) and ER (b) adjacent to the nucleus of DRG neurons is 
shown. Scale bars: a, 500 nm; b, 200 nm. ¢, Representative examples of 

Golgi compartments. Scale bar, 200 nm. (n = 25 cells for a-c). d, e, Lateral 


Fomigan”” 


Ponto” 


cross-sections of the axons recognizable by their heavy myelination around the 
plasma membrane and their emptiness in organelles. Panel d shows a bundle 
of axons whereas panel e presents a single axon. e, lower panels, longitudinal 
cross-sections of neurites, which are typically packed with microtubules and 
ER. ER appears as thin black stripes in these images. ax, axon; ER, endoplasmic 
reticulum; G, Golgi; M, mitochondrium; mt, microtubule; my, myelin; N, 
nucleus, LD, lipid droplet. Scale bars: d, 1 jim; e, upper panels, 1 um; lower 
panels, 500 nm. Representatives of 25 images per genotype are shown. 
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Extended Data Figure 10 | Model of FAM134B function. To drive ER- subsequent budding of ER-derived vesicles. Mature autophagosomes fuse with 
phagy, FAM134B clusters at the edges of cisternal ER. Local enrichment of lysosomes leading to the degradation of enclosed ER fragments. 
FAM134B LIR leads to the recruitment of autophagosomal membranes and 


©2015 Macmillan Publishers Limited. All rights reserved 


des A A lea 


doi:10.1038/nature14506 


Receptor-mediated selective autophagy degrades 
the endoplasmic reticulum and the nucleus 


Keisuke Mochida!, Yu Oikawa?, Yayoi Kimura’, Hiromi Kirisako'*, Hisashi Hirano’, Yoshinori Ohsumi? & Hitoshi Nakatogawa'"* 


Macroautophagy (hereafter referred to as autophagy) degrades 
various intracellular constituents to regulate a wide range of cellular 
functions, and is also closely linked to several human diseases’. 
In selective autophagy, receptor proteins recognize degradation 
targets and direct their sequestration by double-membrane vesicles 
called autophagosomes, which transport them into lysosomes or 
vacuoles’. Although recent studies have shown that selective auto- 
phagy is involved in quality/quantity control of some organelles, 
including mitochondria and peroxisomes’, it remains unclear how 
extensively it contributes to cellular organelle homeostasis. Here we 
describe selective autophagy of the endoplasmic reticulum (ER) and 
nucleus in the yeast Saccharomyces cerevisiae. We identify two 
novel proteins, Atg39 and Atg40, as receptors specific to these path- 
ways. Atg39 localizes to the perinuclear ER (or the nuclear envel- 
ope) and induces autophagic sequestration of part of the nucleus. 
Atg40 is enriched in the cortical and cytoplasmic ER, and loads 
these ER subdomains into autophagosomes. Atg39-dependent 
autophagy of the perinuclear ER/nucleus is required for cell survival 
under nitrogen-deprivation conditions. Atg40 is probably the func- 
tional counterpart of FAM134B, an autophagy receptor for the ER 
in mammals that has been implicated in sensory neuropathy’. 
Our results provide fundamental insight into the pathophysiologi- 
cal roles and mechanisms of ‘ER-phagy and ‘nucleophagy in 
other organisms. 

Autophagy receptors bind to Atg8/LC3 homologues on forming 
autophagosomal membranes to allow membrane expansion along 
the surface of degradation targets*®. To understand comprehensively 
the physiological roles of selective autophagy, we sought to identify 
Atg8-binding proteins in S. cerevisiae, which should include unknown 
autophagic receptors. Among proteins detected by mass spectrometry 
of Atg8 immunoprecipitates, we focused on two uncharacterized 
proteins, Ylr312c and Yorl52c, which were named Atg39 and 
Atg40, respectively. Atg39 and Atg40 are not homologous to each 
other, although both contain putative transmembrane domains 
(Fig. 1a). Yeast two-hybrid assays and immunoprecipitation analysis 
confirmed the interactions of Atg39 and Atg40 with Atg8 (Fig. 1b and 
Extended Data Fig. 1a, b). These interactions were weakened by muta- 
tions in the Atg8 pocket that binds to the Atg8-family-interacting 
motif commonly found in autophagy receptors’ (Extended Data 
Fig. la, c). Consistent with this, we identified Atg8-family-interacting 
motifs in Atg39 (W8-XX-V11) and Atg40 (Y242-XX-M245); muta- 
tions in these motifs abolished the interactions between these proteins 
and Atg8 (Fig. la, b and Extended Data Fig. 1b). In yeast, another 
feature of autophagy receptors is binding to Atg11, which recruits the 
autophagosome-formation machinery onto receptor-target com- 
plexes*. Our analysis suggested that Atg39 and Atg40 also interact with 
this protein, and also found a consensus Atg11-binding sequence’ in 
Atg39 (Fig. la and Extended Data Fig. 1a, d). These results suggested 
that Atg39 and Atg40 are bona fide autophagic receptors. 

Atg39 and Atg40 were dispensable for selective autophagy path- 
ways, which target vacuolar enzymes, mitochondria, and peroxisomes, 


and non-selective autophagy also occurred normally without these 
proteins (Extended Data Fig. 2a—d). The protein levels of Atg39 and 
Atg40 markedly increased when cells were treated with rapamycin, 
which mimics nitrogen starvation by inactivating the TORC1 
complex’ (Fig. 2a). A previous study reported that the ER is efficiently 
degraded via autophagy under these conditions”. In this study, we used 
the ER membrane protein Sec63 fused with GFP to monitor autophagy 
of the ER. Autophagic degradation of Sec63-GFP yielded vacuolar 
protease-resistant GFP fragments, which increased GFP fluorescence 
in the vacuole in fluorescence microscopy and were also detected by 
immunoblotting analysis (Fig. 2b, c). Our results suggested that 
macroautophagy of the ER induced by nitrogen deprivation or rapa- 
mycin treatment (hereafter referred to as ER-phagy, whereas a pre- 
vious study also described ER stress-induced microautophagy of the 
ER using the same term"®) involved an unknown receptor (Extended 
Data Fig. 3a—d). Knockout of either ATG39 or ATG40 partially blocked 
ER-phagy, and the double knockout almost completely blocked this 
pathway (Fig. 2b, c) (Atg] is a protein kinase essential for autophago- 
some formation’). Atg39 and Atg40 mutants deficient for binding to 
Atg8 or Atgl1 also showed defects in ER-phagy (Extended Data 
Fig. 3e). In addition, in cells lacking Ypt7, a GTPase required for 
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Figure 1 | Atg39 and Atg40 interact with Atg8 and Atgl1. a, Schematic 
diagram of Atg39 and Atg40. AIM, Atg8-family-interacting motif; 11BR, 
Atg11-binding region; HR, hydrophobic region predicted to be a transmem- 
brane domain. Alanine substitutions for red residues impair Atg39 or Atg40 
binding to Atg8 or Atg11 (see Fig. 1b and Extended Data Fig. 1b, d). b, Lysates 
(input) prepared from cells treated with rapamycin for 12 h were subjected to 
immunoprecipitation using anti-GFP antibody, and the immunoprecipitates 
(IP) were analysed by immunoblotting using anti-GFP or anti-HA antibody. 
Representative results of three technical replicates. WT, wild type. 
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Figure 2 | Atg39 and Atg40 are receptors for ER- 
phagy. a, Immunoblotting analysis of Atg39 and 
Atg40. PEP4 was disrupted to block autophagic 
degradation of these proteins. Pgk1, loading 
control; Rapa., rapamycin. Representative 
immunoblots of two technical replicates. 

b, Fluorescence microscopy images of cells treated 
with rapamycin (representative of at least 100 
cells). FM 4-64 stains the vacuolar membrane. 

c, Sec63—GFP degradation was analysed by 
immunoblotting using anti-GFP antibody. GFP’, 
GFP fragments generated by Sec63-GFP degrada- 
tion. Representative results from four technical 
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autophagosome-vacuole fusion", Atg8-positive autophagosomes 
containing Sec63-positive ER fragments accumulated when treated 
with rapamycin; this accumulation was abolished by double knockout 
of ATG39 and ATG40, although Sec63-negative autophagosomes were 
still observed (Extended Data Fig. 3f). Furthermore, overexpression of 
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without Atg40. d, e, Degradation of Hmg1-GFP (d) or Rtnl-GFP (e). Repre- 
sentative results from three technical replicates. The quantification results 
(see Methods) are shown as the mean + s.d. (n = 3). f-h, Immunoelectron 
microscopy (ImmunoEM) images (using anti-Kar2 antibodies) of cells treated 
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replicates. The quantification results (see Methods) 
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** P< 0,001 (unpaired two-tailed Student’s 
t-test). d, Fluorescence microscopy images of cells 
treated with rapamycin for 3 h (representative of 
at least 100 cells). Arrowheads, Atg8-positive 
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these proteins stimulated ER-phagy (Extended Data Fig. 4a). Both 
Atg39-GFP and Atg40-GFP localized to the ER (see the next para- 
graph) and also formed Atg8- and Atgl11-positive puncta near the 
vacuole, suggesting their localization to the pre-autophagosomal 
structure’ (Fig. 2d and Extended Data Fig. 4b). Finally, Atg39 and 
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Atg40 themselves were transported to the vacuole in cells undergoing 
ER-phagy (Extended Data Fig. 4c—e). On the basis of these results, we 
concluded that Atg39 and Atg40 are ER-phagy receptors. 

As demonstrated by fluorescence microscopy of Sec63-mCherry 
(Fig. 3a), the ER in yeast consists of two distinct subdomains, the 
perinuclear ER (pnER) and the cortical (cell periphery) ER (cER), 
and these domains are connected with cytoplasmic ER (cytoER)’’. 
Whereas Atg39 specifically localized to the pnER, a large proportion 
of Atg40 resided in the cER and cytoER, although weak localization to 
the pnER was also observed (Fig. 3a). The pnER localization of Atg39 
was confirmed by its co-localization with the pnER protein Hmg1 
(Fig. 3b). We also showed that Atg40 co-localizes well with the reti- 
culon Rtn1, which localizes to highly curved regions in ER membranes 
such as tubules and sheet edges”* but is excluded from ER sheets, in 
which Sec63 resides (Fig. 3c). In accordance with their characteristic 
localization, whereas ATG39 disruption severely impaired degradation 
of the pnER (Hmg1-GFP), ATG40 disruption caused a weaker defect 
in degradation of this subdomain (Fig. 3d and Extended Data Fig. 5). 
By contrast, degradation of the cER and cytoER (Rtn1-GFP) strongly 
depended on Atg40, but not Atg39 (Fig. 3e and Extended Data Fig. 5). 
These results suggest that Atg39 and Atg40 are differently involved in 
selective autophagy of these ER subdomains: Atg39 has a major role in 
pnER-phagy, whereas Atg40 is primarily responsible for CER/cytoER- 
phagy. 

We used electron microscopy to morphologically characterize the 
ER sequestered into the autophagosome. Cells lacking the vacuolar 
protease Pep4 accumulate autophagic bodies (inner autophagosomal 
membrane vesicles containing sequestered material) inside the vacu- 
ole’*. We immunogold-labelled the ER using antibodies against the 
lumenal protein Kar2 and closely investigated the contents of auto- 
phagic bodies (Fig. 3f-h, j and Extended Data Fig. 6a). Autophagic 
bodies enclosing Kar2-positive membranes together with a part of the 
cytoplasm were frequently observed and categorized into two types 
based on their morphologies: double rings that probably represent 


cross-sections of double-membrane vesicles (Fig. 3g and Extended 
Data Fig. 6b, c), and intricate structures that appear to be folded 
tubules or sheets (Fig. 3h, i and Extended Data Fig. 6d). Knockout of 
ATG39 or ATG40 specifically reduced autophagic bodies containing 
double-ring structures or those containing tubules/sheets, respectively 
(Fig. 3j), suggesting that double-ring structures and folded tubules/ 
sheets are derived from the pnER and the cER/cytoER, respectively. 
Taken together, our results suggest that Atg39 and Atg40 have central 
roles in loading the pnER as double-membrane vesicles and the cER/ 
cytoER as folded tubules/sheets, respectively, into autophagosomes 
(Extended Data Fig. 7). 

Double-membrane vesicles derived from the pnER contained Kar2 
signals in the intermembrane space (Fig. 3g), suggesting that this space 
had originally been the pnER lumen. The pnER is equivalent to the 
nuclear envelope. We hypothesized that the outer and inner mem- 
branes of double-membrane vesicles are derived from the outer and 
inner nuclear envelope/pnER membranes, respectively. Indeed, the 
inner nuclear membrane protein Srcl was also degraded in a manner 
similar to the outer nuclear membrane protein Hmg1 (Figs 3d and 4a 
and Extended Data Fig. 8a, b). This was also the case for the nucleolar 
protein Nop1 (Fig. 4b and Extended Data Fig. 8c). Immunoelectron 
microscopy detected Nop] signals inside double-membrane vesicles, 
although with low frequency (Fig. 4c and Extended Data Fig. 8d). 
These results suggest that Atg39 directs autophagic sequestration of 
double-membrane vesicles derived from the nuclear envelope/pnER, 
which also contain some intranuclear components. Therefore, Atg39- 
mediated selective autophagy should also be called nucleophagy, as 
well as pnER-phagy (Extended Data Fig. 7). Deletion of NVJ1, which is 
required for microautophagy of the nucleus’, did not affect Nop1 
degradation (Extended Data Fig. 8c), suggesting that Atg39-dependent 
nucleophagy is predominant in nucleolus degradation in our experi- 
mental conditions. 

When exposed to prolonged nitrogen starvation, atg39A cells 
showed abnormal morphology in the nucleus/pnER, and lost their 
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viability earlier than wild-type cells (Fig. 4d, e and Extended Data Fig. 
9a, b). Importantly, non-selective autophagy, which is also required for 
cell survival under these conditions!®, occurred normally in the mutant 
cells (Extended Data Fig. 2a, d). In addition, ATG39 deletion did not 
exaggerate starvation-induced death in cells lacking Atgl (Extended 
Data Fig. 9c). Atg39 mutant cells defective in the interaction with Atg8 
or Atgl1 also exhibited similar phenotypes (Extended Data Fig. 9d). 
These results suggest that atg39 mutants are susceptible to cell death 
under nitrogen-deprived conditions due to their inability to perform 
pnER-phagy/nucleophagy, and thus the physiological significance of 
this pathway. On the other hand, in atg40A cells, the CER was more 
densely reticulated than wild-type and atg39A cells, whereas the nuc- 
leus/pnER appeared to be normal (Fig. 4d, fand Extended Data Fig. 9e, 
f). However, no significant decrease in cell viability was observed in 
atg40A cells under nitrogen-starvation conditions (Fig. 4e); Atg40- 
dependent ER-phagy may be more important for cell survival or 
adaptation under other circumstances. 

The putative transmembrane domains of Atg40 show a similarity to 
those of reticulon-related proteins (Extended Data Fig. 10), consistent 
with the observation that Atg40 favours highly curved ER regions 
(Fig. 3c). An accompanying paper identified the reticulon-domain- 
containing protein FAM134B as a mammalian ER-phagy receptor, 
and showed that its disruption causes progressive sensory neuro- 
degeneration’. The domain architectures of Atg40 and FAM134B 
resemble each other (Extended Data Fig. 10). In addition, FAM134B 
localizes to the cytoplasmic ER network rather than the pnER’. These 
results suggest that FAM134B is the mammalian counterpart of Atg40. 
Atg39 homologues are found only in closely related yeast species. 
However, macroautophagy of the nucleus has been reported in other 
organisms, including mammals'”""’, and the molecular bases remain 
unclear. These organisms may possess Atg39 homologues with very 
low sequence similarity to the yeast protein, or non-homologous 
counterparts that share fundamental functions with Atg39, such as 
nuclear envelope localization and interaction with the autophago- 
some-formation machinery, as in the cases of autophagy receptors 
for mitochondria and protein aggregates*”’. 

In this study, we show that the ER and the nucleus are degraded by 
receptor-mediated selective autophagy. Our findings pave the way 
towards understanding the role of autophagy in the maintenance 
and regulation of ER and nuclear functions. However, several ques- 
tions remain to be answered. (1) The components in the ER and the 
nucleus that must be degraded have not yet been identified. (2) The 
purpose of ER-phagy and nucleophagy under nutrient-limiting con- 
ditions, whether it is elimination of deleterious material or recycling of 
degradation products to provide scarce molecules, remains unknown. 
(3) In the context of ER-phagy, cells require two receptors to degrade 
different ER subdomains, but the reasons for this are unexplored. 
Under certain situations, cells may use only one receptor to specifically 
degrade the corresponding subdomain. (4) On a mechanistic level, it 
remains to be determined whether the generation of ER fragments and 
nuclear-envelope-derived vesicles is coupled with, or separated from, 
their sequestration into autophagosomes. (5) Whether Atg39 and 
Atg40 are also involved in ER fragmentation and nuclear envelope 
vesiculation also requires investigation. Future studies should address 
these important issues. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Yeast strains and media. Strains used in this study, derived from BY 4741, BJ3505 
or SEY6210 (refs 21-23), are listed in Supplementary Table 1. Gene disruption and 
tagging were performed by a standard PCR-based method**”. Yeast cells were 
grown at 30°C in YPD medium (1% yeast extract, 2% peptone and 2% glucose) or 
SD+CA medium (0.17% yeast nitrogen base without amino acids and ammonium 
sulphate (YNB w/o aa andas), 0.5% ammonium sulphate, 0.5% casamino acid, and 
2% glucose) with appropriate supplements. To induce ER-phagy and nucleo- 
phagy, cells grown to mid-log phase were treated with 200 ng ml~' rapamycin 
or incubated in SD-N medium (0.17% YNB w/o aa and as, and 2% glucose). 
Plasmids. pRS314-ATG39-6HA, pRS314-ATG40-6HA (used in Fig. 1b and 
Extended Data Fig. 1b), pRS426-ATG39-6HA, and pRS426-ATG40-6HA (used 
in Extended Data Fig. 3e) were constructed as follows. The coding sequences of 
ATG39-6HA and ATG40-6HA with their own promoters were amplified by PCR 
using genomic DNA prepared from YKM656 and YKM658, and the pairs of the 
oligonucleotides Sacl-ATG39pro-Fw (5’-AAAGAGCTCCTGTAGTTTCAAAA 
ACTGTG-3’)/HA-NotI-Rv (5'-AAAGCGGCCGCGCTGGCCGGGTGACCCG 
GCG-3') and Sacl-ATG40pro-Fw (5'’-AAAGAGCTCGTTCTCCACGGACGT 
TTTCT-3')/HA-NotI-Ry, respectively. These PCR products were digested with 
Sacl and NotI, and ligated into the single-copy vector pRS314 or the multi-copy 
vector pRS426 (ref. 26). pRS316-ATG39-EGFP and pRS316-ATG40-EGFP (used 
in Extended Data Figs 1d, 4e and 9d) were constructed similarly by inserting 
DNA fragments, which were obtained by PCR using genomic DNA prepared 
from YKM564 and YKM567, and the oligonucleotides SacI-ATG39pro-Fw/ 
ADH Iter-XholI-Rv (5’-AAACTCGAGCCGGTAGAGGTGTGGTCAAT-3’) and 
Sacl-ATG40pro-Fw/ADH1ter-Xhol-Rv, respectively, into pRS316. Mutations in 
ATG39 and ATG40 were introduced by site-directed mutagenesis using Quik 
Change kit (Agilent Technologies). To construct pRS316-atg8°'"-GFP and 
pRS316-atg8'?*4 8674 GHOA_GED (used in Fig. 1b and Extended Data Fig. 1b, c), 
the BamHI site was introduced immediately upstream of the stop codon of ATG8 
in pRS316-ATG8 (ref. 11), and then a GFP sequence was inserted into the site, 
resulting in pRS316-ATG8-GFP. The mutations were introduced into this plas- 
mid by site-directed mutagenesis to obtain the above plasmids. A control plasmid 
expressing GFP alone (used in Fig. 1b and Extended Data Fig. 1b, c, d) was 
constructed by removing the ATG8 sequence from pRS316-GFP-ATG8 (ref. 9) 
by site-directed mutagenesis. 

Yeast two-hybrid assay. Yeast two-hybrid analysis was performed as described 
previously”’. Full-length ATG39, ATG8, atg8” 2A ROTA” 4 TGI1, and truncated 
atg40 (Ser93-Asp256) were inserted into the pGAD-T7 (Clontech) or pGBD-C1 
(ref. 27) plasmids. The AH109 yeast cells transformed with the generated plasmids 
were grown in SC medium (0.17% YNB w/o aa and as, 0.5% ammonium sulphate, 
2% glucose, and appropriate supplements) lacking leucine and tryptophan, spotted 
on to SC (-Leu, -Trp) and SC (-Leu, -Trp, -His) agar plates, and grown at 30°C 
for 2-4 days. 

Immunoblotting analysis. Harvested cells were treated with 20% trichloroacetic 
acid on ice for 20 min, centrifuged at 15,000g for 5 min, washed with cold acetone, 
dried at room temperature, suspended in SDS sample buffer (100 mM Tris-HCl 
(pH 7.5), 2% SDS, 20 mM dithiothreitol (DTT), and 10% glycerol), and incubated 
at 65°C for 10 min. Urea sample buffer (37.5 mM MOPS-NaOH (pH 6.8), 2% 
SDS, 100 mM DTT, and 4 M urea) was used for immunoblotting analysis of Atg39. 
These cells were disrupted using a FastPrep-24 (MP Biomedicals) and 0.5-mm 
YZB zirconia beads (Yasui Kikai). Samples were separated by SDS-PAGE, and 
analysed by immunoblotting using antibodies against GFP (Clontech, 632381), 
HA (Roche, 11867431001), Pgkl (Invitrogen), Atgl1, Apel (our laboratory 
stocks), and mCherry (gifted from T. Endo). To calculate degradation of GFP- 
fused proteins (%), GFP’ intensity was divided by the sum of GFP’ and GFP-fused 
full-length protein intensities, and the values relative to that in wild-type cells 
are shown. 

Immunoprecipitation. Cells harvested by centrifugation were disrupted in IP 
buffer (50 mM Tris-HCl (pH 8.0), 150 mM NaCl, 5 mM EDTA, 5 mM EGTA, 
50 mM Naf, and 10% glycerol) containing 2 mM phenylmethylsulfonyl fluoride 
(PMSF), 2X Complete protease inhibitor cocktail (Roche), 0.1X PhosSTOP phos- 
phatase inhibitor cocktail (Roche), and 50 nM microcystin using a Multi-beads 
Shocker (Yasui Kikai) and 0.5-mm YZB zirconia beads. The lysates were solubi- 
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lized by rotation at 4°C for 30 min with 0.5% Triton X-100, and then cleared by 
centrifugation at 15,000g for 15 min. The supernatants were incubated with GFP- 
Trap_M beads (ChromoTek) at 4 °C for 2 h. The beads were washed with IP buffer 
containing 0.3% Triton X-100 three times, and then incubated in SDS sample 
buffer at 65°C for 10 min to elute bound proteins. 

Fluorescence microscopy. Fluorescence microscopy was performed using an 
inverted fluorescence microscope (Olympus, IX81) equipped with an electron- 
multiplying CCD camera, a 150 objective lens, a 488-nm blue laser, a 561-nm 
yellow laser, and appropriate mirrors and filters, as described previously**. Images 
shown in Fig. 4d and Extended Data Fig. 9b were obtained using a different 
microscope system”’. Vacuolar membranes and DNA were stained with 0.3 1M 
FM 4-64 (Invitrogen) and 0.5 pg ml~' Hoechst 33258 (Dojindo), respectively. 
Images were acquired using the MetaMorph software (Molecular Devices) and 
processed using MetaMorph and Adobe Photoshop CS6. Images shown in 
Extended Data Figs 3f and 9f were obtained by deconvolution of original images 
using AutoQuant X3. Intensities of Sec63-GFP at the periphery of cells were 
analysed by line scan function in Metamorph. 

Electron microscopy. Electron microscopy was performed by Tokai-EMA Inc. 
based on rapid freezing and freeze-fixation methods. Cells were sandwiched with 
copper disks and quickly frozen in liquid propane at —175°C using EM CPC 
(Leica). The frozen cells were treated with 2% OsO, and 2% distilled water in 
acetone (Extended Data Fig. 9a) or 0.5% tannic acid and 2% distilled water in 
acetone, and then with 2% OsO, in acetone (Fig. 3i and Extended Data Fig. 6b-d). 
Samples were dehydrated, and embedded in Quetol-651. Electron microscopy was 
performed using a transmission electron microscope (JEM-1400Plus: JEOL Ltd). 
For immunoelectron microscopy, frozen cells were treated with 0.2% glutaralde- 
hyde and 2% distilled water in acetone, and embedded in LR white. Ultrathin 
sections were stained with anti-Kar2 antibodies (gifted from T. Endo) and anti- 
Nop] antibody (GeneTex, 28F2), and secondary antibodies conjugated to 10-nm 
gold particles. Autophagic bodies containing Kar2-positive double-ring structures 
or tubules/sheets were counted in three independent experiments (more than 250 
autophagic bodies were examined in each experiment). 

Cell viability assay. Yeast cells grown to mid-log phase (OD¢o9 = 1.5) in SD+CA 
medium with appropriate supplements were shifted to SD-N medium, and incu- 
bated for the indicated time periods. Dead cells were stained with 2.5 pg ml! 
phloxine B as described previously”. At least 200 cells were examined for each 
strain, and the proportion of phloxine B-positive cells to total cells observed under 
a bright field was calculated. 
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Extended Data Figure 1 | Interactions of Atg39 and Atg40 with Atg8 and 
Atg11. a, Yeast two-hybrid assay. Cells expressing proteins fused with the Gal4 
activation domain (GAD) or the Gal4 DNA-binding domain (GBD) were 
grown on the agar plate with or without histidine (His). Representative results 
of two technical replicates are shown. Growth without His indicates positive 
interaction. The P52A R67A mutation in Atg8 weakens its binding to the Atg8- 
family-interacting motif (AIM)”°. b, In the experiments described in Fig. 1b, the 
protein band intensities were measured, and the relative ratios of co- 
precipitated proteins to direct targets of immunoprecipitation (coIP (%)) are 
shown as the mean + s.d. (n = 3). WT, wild type. c, Cells were treated with 


rapamycin for 13 h, and the lysates (input) were subjected to 
co-immunoprecipitation analysis using anti-GFP antibody. The 
immunoprecipitates (IP) were analysed by immunoblotting. c, Co- 
immunoprecipitation efficiencies were calculated as described in b, and the 
results are shown as the mean + s.d. (n = 3). d, Cells treated with rapamycin for 
12 h were subjected to co-immunoprecipitation analysis as described in 

c. Eight residues in the Asp52-Ser59 region in Atg39 were replaced with Ala in 
the D52-S59A mutant. Representative results of three technical replicates 

are shown (c, d). *P < 0.05; **P < 0.01; ***P < 0.001 (unpaired two-tailed 
Student’s t-test) (b, c, d). 
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Extended Data Figure 2 | Atg39 and Atg40 are not required for the 
cytoplasm-to-vacuole targeting pathway, mitophagy, pexophagy, and non- 
selective autophagy. a, Cells were incubated in nitrogen starvation medium, 
and the processing of the proform (p) of the vacuolar aminopeptidase Apel 
into the mature form (m), which is indicative of activity of the cytoplasm-to- 
vacuole targeting pathway, was examined by immunoblotting using anti-Apel 
antibodies. mApel intensity was divided by the sum of mApel and pApel 
intensities, and the relative values are shown (Apel maturation (%)). Vacuolar 
degradation of GFP-Atg8, which correlates with activity of non-selective 
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autophagy, was also investigated. The results were quantified (see Methods), 
and the relative values (%) are shown. b, c, Mitophagy and pexophagy were 
examined as described previously*'. Degradation of GFP-fused proteins was 
quantified (see Methods), and the relative values (%) are shown. Representative 
immunoblots of two technical replicates are presented (a-c). d, To assess 
bulk degradation of the cytoplasm by non-selective autophagy, a Pho8A60 
alkaline phosphatase (ALP) assay was performed as described previously”. 
ALP activities are shown as the mean + s.d. (n = 3). AU, arbitrary units; 
NS, not significant; ***P < 0.001 (unpaired two-tailed Student’s t-test). 
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Extended Data Figure 3 | The ER is degraded via selective macroautophagy immunoblots of three (a, b, d, e) or four (c) technical replicates are shown. 
under nitrogen-deprived conditions in a manner dependent on Atg39 and = Sec63-GFP degradation (%) was quantified (see Methods), and the results 
Atg40. a-e, Cells treated with rapamycin or incubated in nitrogen-starvation are shown as the mean + s.d. (n = 3 (b, d, e) or 4 (c)). NS, not significant; 


medium were subjected to immunoblotting analysis. Pep4 and Ypt7 are *P < 0.05; **P < 0.01; ***P < 0.001 (unpaired two-tailed Student’s t-test) 
required for vacuolar lytic activity and autophagosome-vacuole fusion, (b-e). The results in c and d showed that ER-phagy requires Atg]1 and the 
respectively. Atg17 is a subunit of the Atg] kinase complex and required for | AIM-binding ability of Atg8, but not previously characterized autophagic 
starvation-induced autophagy’. Atg19 and Atg34 are autophagy receptors receptors, suggesting the existence of an unknown receptor for this pathway. 
for vacuolar enzymes; Atg32 and Atg36 are those for mitochondria and f, Fluorescence microscopy of cells treated with or without rapamycin. Images 
peroxisomes, respectively. In e, atg39A or atg40A cells harbouring Atg39-HA are representative of at least 150 cells. Arrowheads, Atg8-and Sec63-positive 
or Atg40-HA plasmids, respectively, were analysed. Representative puncta. Scale bars, 5 um. 
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Extended Data Figure 4 | Atg39 and Atg40 function as ER-phagy receptors. 
a, Atg39-mCherry or Atg40-mCherry was expressed under the endogenous 
promoter (endo.) or the CUP1 promoter for overexpression (OE) in the 
presence of 250 uM CuSO,. These cells were treated with rapamycin, and 
degradation of Sec63—GFP was examined by immunoblotting. Pgk1 serves as a 
loading control. Representative immunoblots of three technical replicates are 
presented, and the quantification results (see Methods) are shown as the 
mean + s.d. (n = 3). *P < 0.05; **P < 0.01 (unpaired two-tailed Student’s 
t-test). The results for atg1A cells suggest that the rapamycin-induced decrease 
in Atg39-mCherry overexpressed in wild-type cells is due to its autophagic 
degradation. b, The co-localization of Atg39 or Atg40 with Atg11 in cells 
treated with rapamycin for 3 h was observed under a fluorescence microscope. 
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Images are representative of at least 100 cells. Scale bars, 2 um. c-e, Vacuolar 
degradation of Atg39-GFP and Atg40-GFP was examined by immunoblotting 
(c, e) and fluorescence microscopy (d). atg39A and atg40A cells carrying Atg39- 
GFP and Atg40-GFP plasmids, respectively, were examined in e. Representative 
immunoblots of three (Atg39-GFP degradation in , e), or four (Atg40-GFP 
degradation in ¢, e) technical replicates are presented, and the quantification 
results are shown as the mean + s.d. (n = 3 (Atg39-GFP degradation in c, e) or 4 
(Atg40-GFP degradation in c, e)). ***P < 0.001 (unpaired two-tailed Student’s 
t-test) (c, e). Images are representative of at least 100 cells (d). Scale bars, 5 jum. 
The results in c—e suggest that vacuolar transport of Atg39 and Atg40 requires 
their binding to Atg11 and Atg8. 
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Extended Data Figure 5 | Atg39 and Atg40 are differently involved in degradation of the pnER and the cER/cytoER. Vacuolar transport of Hmg1-GFP and 
Rtn1-GFP was examined by fluorescence microscopy. Images are representative of at least 100 cells. Scale bars, 5 um. 
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Extended Data Figure 6 | Electron microscopy of the ER sequestered into 
autophagic bodies. a, pep4A cells treated with rapamycin for 13 h were 
analysed by immunoelectron microscopy using anti-Kar2 antibodies. Images 
are representative of at least 50 cells. Arrowheads, Kar2 signals (gold particles). 
These images show the staining of the cortical ER (left) and the perinuclear ER/ 
nuclear envelope (right). b-d, Electron microscopy images (without 


immunogold labelling) of cells incubated in nitrogen-starvation medium for 
12h (b, d) or treated with rapamycin for 12 h (c). Images are representative of 
at least 14 cells that contained autophagic bodies. Arrowheads, autophagic 
bodies containing double-ring structures (b, c) or those containing tubules/ 
sheets (d). Scale bars, 200 nm (a-d). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


perinuclear ER/ 
nuclear envelope 


Atg39-dependent 
pnER-phagy/nucleophagy 


Extended Data Figure 7 | Model for Atg39- and Atg40-mediated ER-phagy 
and nucleophagy. Atg39 localizes to the perinuclear ER (pnER)/nuclear 
envelope and acts to load a part of the pnER/nucleus as double-membrane 
vesicles, which contain intranuclear components, into autophagosomes. Atg40 
mainly localizes to highly curved regions in the cortical ER (cER) and 
cytoplasmic ER (cytoER), and allows sequestration of these ER domains as 


autophagosome 


vacuole 


OAtg39 
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folded tubules or sheets into autophagosomes. Whereas Atg39 is dispensable 
for cER/cytoER-phagy, Atg40 also contributes to pnER-phagy/nucleophagy. 
It is likely that both of these receptors interact with Atg11 to recruit the core 
Atg proteins that mediate autophagosome formation, and also with Atg8 to 
promote autophagic sequestration of the ER or a part of the nucleus. 
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Extended Data Figure 8 | Atg39-dependent degradation of part of the 
nucleus. a, Srcl-GFP degradation assay for cells before rapamycin treatment. 
The results for cells treated with rapamycin are shown in Fig. 4a. b, Vacuolar 
transport of Srcl-GFP was examined by fluorescence microscopy. Images are 
representative of at least 100 cells. Scale bars, 5 um. c, Autophagic degradation 
of Nop1-GFP was examined by immunoblotting. Representative results of two 
(a) or three (c) technical replicates are shown. Nop1-GFP degradation was 


immunoEM: a-Nop1 


nuclear envelope 


quantified (see Methods), and the results are shown as the mean = s.d. (n = 3). 
NS, not significant; ***P < 0.001 (unpaired two-tailed Student’s t-test). 

d, pep4A cells were treated with rapamycin for 13 h, followed by immuno- 
electron microscopy using anti-Nop1 antibody. Images are representative of six 
cells containing Nop] signals in the nucleolus. Scale bars, 500 nm (left) and 200 
nm (right). The data show that the nucleolus is specifically labelled with gold 
particles. Arrowheads, autophagic bodies in the vacuolar lumen. 
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Extended Data Figure 9 | Phenotypes of atg39 and atg40 mutants under 
nitrogen-deprived conditions. a, Cells were examined by electron 
microscopy. Images are representative of at least 14 cells. b, Cells were subjected 
to nitrogen starvation, and dead cells were stained with phloxine B. Images are 
representative of at least 600 cells. DIC, differential interference contrast 
microscopy images. c, Dead cells were counted (see Methods), and the 
results are shown as the mean + s.d. (n = 6). d, Wild-type or atg39A cells 
harbouring Atg39-GFP plasmids or the empty vector were subjected to 
nitrogen starvation, and dead cells were counted. The results are shown as 
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the mean + s.d. (1 = 3). NS, not significant; *P < 0.05; ***P < 0.001 (unpaired 
two-tailed Student’s t-test) (c, d). e, f, Cells expressing Rtn1-GFP (e) or Sec63- 
GFP (f) were subjected to nitrogen starvation, and the fluorescence microscopy 
images were taken by focusing cell periphery (e) or centre (f). Images are 
representative of at least 100 (e) or 200 (f) cells. In f, the fluorescence intensity of 
Sec63-GFP at the periphery of cells marked with asterisks (left) was measured 
along cyan lines in insets and graphically shown (right). Scale bars, 500 nm 
(a), 5 um (b, e, f). 
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Alternative 3’ UTRs act as scaffolds to regulate 
membrane protein localization 


Binyamin D. Berkovits' & Christine Mayr! 


About half of human genes use alternative cleavage and polyade- 
nylation (ApA) to generate messenger RNA transcripts that differ 
in the length of their 3’ untranslated regions (3’ UTRs) while pro- 
ducing the same protein’ *. Here we show in human cell lines that 
alternative 3’ UTRs differentially regulate the localization of mem- 
brane proteins. The long 3’ UTR of CD47 enables efficient cell 
surface expression of CD47 protein, whereas the short 3’ UTR 
primarily localizes CD47 protein to the endoplasmic reticulum. 
CD47 protein localization occurs post-translationally and inde- 
pendently of RNA localization. In our model of 3’ UTR-dependent 
protein localization, the long 3’ UTR of CD47 acts as a scaffold to 
recruit a protein complex containing the RNA-binding protein 
HuR (also known as ELAVL1) and SET“ to the site of translation. 
This facilitates interaction of SET with the newly translated cyto- 
plasmic domains of CD47 and results in subsequent translocation 
of CD47 to the plasma membrane via activated RACI (ref. 5). We 
also show that CD47 protein has different functions depending on 
whether it was generated by the short or long 3’ UTR isoforms. 
Thus, ApA contributes to the functional diversity of the proteome 
without changing the amino acid sequence. 3’ UTR-dependent 
protein localization has the potential to be a widespread trafficking 
mechanism for membrane proteins because HuR binds to thou- 
sands of mRNAs*”, and we show that the long 3’ UTRs of CD44, 
ITGA1 and TNFRSF13C, which are bound by HuR, increase sur- 
face protein expression compared to their corresponding short 
3’ UTRs. We propose that during translation the scaffold function 
of 3’ UTRs facilitates binding of proteins to nascent proteins to 
direct their transport or function—and this role of 3’ UTRs can be 
regulated by ApA. 

Alternative 3’ UTR isoform abundance was shown to be highly cell- 
type-specific and can change upon proliferation, differentiation and 
transformation’. Alternative 3’ UTR isoforms produce the same pro- 
tein, but the long 3’ UTRs contain additional regulatory elements that 
can regulate mRNA localization and protein abundance'*"’. We have 
discovered a new function of 3’ UTRs: they can regulate protein local- 
ization independently of RNA localization. 

CD47 is best known as a ubiquitous cell surface molecule that acts as 
a marker of self and protects cells from phagocytosis by macro- 
phages'’”*. We found CD47 protein expressed on the cell surface, as 
well as intracellularly (Fig. la and Extended Data Fig. 1a—c). The CD47 
gene produces alternative 3’ UTRs as determined by the 3’-end 
sequencing method 3’-seq and confirmed by northern blot analysis 
(Fig. 1b, c)’. Exclusive knockdown of the longer 3’ UTR isoform by 
short hairpin RNAs (shRNAs) decreased CD47 surface expression 
without changing intracellular expression (Fig. 1d and Extended 
Data Fig. 1d—g). This suggests that the long 3’ UTR isoform facilitates 
cell surface localization of CD47 protein. 

To test this hypothesis, we asked whether green fluorescent protein 
(GFP) encoded by an mRNA containing the long (with a mutated 
proximal polyadenylation signal; Extended Data Fig. 1h) or the short 
3' UTR of CD47 would localize differently. To allow GFP to enter the 
secretory pathway, we replaced the extracellular domain (ECD) of 


CD47 with GFP, while preserving the CD47 signal peptide, transmem- 
brane domains (TMDs) and carboxy terminus, which we refer to as 
GFP-TM (Fig. le). We observed that GFP-TM encoded by an mRNA 
containing the long 3’ UTR of CD47 (GFP-TM-LU) localizes prim- 
arily to the cell surface whereas GFP-TM encoded by an mRNA with 
the short 3’ UTR of CD47 (GFP-TM-SU) localizes predominantly 
to the endoplasmic reticulum (Fig. 1f and Extended Data Fig. 1i). 
The localization results were confirmed by fluorescence-activated cell 
sorting (FACS) analysis, using an anti-GFP antibody on permeabilized 
and non-permeabilized cells to measure total and surface GFP levels, 
respectively (Fig. 1g). The localization step occurs at the protein level, 
as both the LU- and SU-containing GFP transcripts show a similar 
distribution near the perinuclear endoplasmic reticulum (Fig. 1h). 
Thus the LU isoform of CD47 encodes information that is necessary 
for cell surface expression of GFP-TM protein, in a manner independ- 
ent of RNA localization. 

To address the mechanism of 3’ UTR-dependent protein local- 
ization (UDPL; Fig. 2a), we reasoned that there must be an RNA- 
binding protein (RBP) that binds to the long, but not the short, 
3' UTR of CD47. The long 3’ UTR of CD47 contains many uridine- 
rich elements (see later), which are potentially bound by HuR®°. HuR 
is known for its role in mRNA stabilization and translation activa- 
tion”'*"*, However, HuR knockdown by shRNAs did not affect CD47 
mRNA abundance or isoform levels, nor did it affect total CD47 pro- 
tein levels (Fig. 2b, bottom, and Extended Data Figs 1d, 2a-c). But, 
strikingly, knockdown of HuR reduced CD47 surface expression 
(Fig. 2b, top, and Extended Data Fig. 2c). This suggests that for 
CD47, HuR mediates protein localization post-translationally. 

Beyond the role of HuR as an RBP®”°, HuR interacts through pro- 
tein-protein interactions with SET, ANP32A and ANP32B*. Nuclear 
SET binds to histone tails and prevents acetylation’’, but phosphory- 
lated SET localizes to the cytoplasm and the surface of the endoplasmic 
reticulum®*’*. Also, SET interacts with RACI, and active RACI trans- 
locates SET to the plasma membrane’. In our model of UDPL (Fig. 2a), 
HuR binds to the long 3’ UTR of CD47 and recruits SET. Upon tar- 
geting of the mRNA to the endoplasmic reticulum surface, the scaffold 
function of the 3’ UTR results in local recruitment of SET to the site of 
translation. After translation of CD47 mRNA, the ECD is located in the 
endoplasmic reticulum lumen, whereas its C terminus is cytoplasmic. 
This allows SET to interact with the newly translated C terminus and 
cytoplasmic domains of CD47 and to translocate CD47 to the plasma 
membrane via active RAC1. Transfer of SET from CD47 mRNA to 
CD47 protein probably requires energy input, as has been shown for 
transfer of the signal peptide from the signal recognition particle to the 
translocation channel’’. In this model of UDPL, surface expression of 
CD47-LU depends on SET and active RAC1. And indeed, knockdown 
of SET or RAC] by shRNAs reduced surface expression of CD47 with- 
out affecting overall CD47 levels (Fig. 2b and Extended Data Fig. 2c-e). 

To determine if UDPL is a more widespread phenomenon, we 
examined the localization of four additional transmembrane proteins 
that are derived from mRNAs with 3’ UTR isoforms that can be bound 
by HuR®® (Extended Data Figs 3 and 4). TSPAN13 has only one 
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3’ UTR isoform, whereas the other three genes (CD44, ITGA1 and 
TNFRSF13C (encoding the BAFF receptor, BAFFR) use ApA to gen- 
erate alternative 3’ UTR isoforms (Fig. 2c and Extended Data Fig. 2f)°. 
As was the case for CD47, knockdown of HuR decreased surface 
expression of all four proteins without changing total protein levels 
(Fig. 2c and Extended Data Fig. 2a, f). As was done for CD47, we 
generated GFP-fused LU and SU constructs for CD44, ITGA1 and 
TNFRSF13C with their respective TMDs, C termini and 3’ UTRs. 
For all tested genes, the longer 3’ UTR increased surface localization 
of GFP-TM (Figs 1f-h, 2d and Extended Data Fig. 2g). This demon- 
strates that UDPL has the potential to be a widespread phenomenon. 

The 3’ UTR of CD47 contains over 30 putative HuR-binding sites 
(Extended Data Fig. 3). We tested whether a 3’ UTR with a few HuR- 
binding sites (HuR-BS) (Extended Data Fig. 3) is enough to mediate 
surface localization. Indeed, the uridine-rich sequence was necessary 
and sufficient for surface localization of GFP-TM, although it was less 
potent than the full-length 3’ UTR of CD47 (Fig. 2e). 

Next, we investigated each step of our UDPL model in more detail. 
We demonstrated by RNA-immunoprecipitation that HuR binds to 
the HuR-BS and to the LU but not to the SU isoform of CD47 (Fig. 3a, 
left). SET also associates with the long 3’ UTR of CD47, which is 
dependent on HuR (Fig. 3a, right). SET or HuR overexpression was 
insufficient to localize GFP-TM-SU to the cell surface (Extended Data 
Fig. 5a, b). However, tethering of SET or HuR to the short 3’ UTR 
isoform of CD47 was sufficient to redirect GFP-TM localization from 
the endoplasmic reticulum to the plasma membrane (Fig. 3b and 
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Extended Data Fig. 5c, d; see Extended Data Fig. 5 for experimental 
details). This indicates that local recruitment of SET to the site of 
translation, mediated by the scaffold function of the long 3’ UTR, is 
required for UDPL. 

Furthermore, by co-immunoprecipitation we demonstrated that 
endogenous SET only interacts with CD47-LU protein, but not with 
CD47-SU protein (Fig. 3c). Since SET binds to lysine residues’*, we 
mutated single lysines in the C terminus of CD47, which decreased 
GFP-TM surface localization by up to 37% (Extended Data Fig. 6a). 
Mutation of 2/5 lysines decreased it by more than 50% and deletion of 
the entire C terminus decreased it by 80% (GFP-TM-LUAGC; Fig. 3d 
and Extended Data Fig. 6b). Additional mutation of three lysines in the 
first cytoplasmic loop abolished surface GFP expression (GFP-TM- 
LUACL; Fig. 3d and Extended Data Fig. 6b). The reduction in surface 
expression is probably due to partial or complete loss of SET binding to 
CD47, as demonstrated by co-immunoprecipitation (Fig. 3e). 

To test if the difference in surface localization has phenotypic con- 
sequences, we added the ECD of CD47 to the GFP constructs (called 
CD47-LU or CD47-SU; Fig. 4a). Both constructs resulted in compar- 
able overall CD47 protein levels (Extended Data Fig. 7a). CD47-LU 
efficiently localized to the cell surface via UDPL mediated by active 
RAC] (Fig. 4b, c and Extended Data Fig. 7b). Whereas GFP expressed 
from the GFP-TM-SU construct nearly completely localized to the 
endoplasmic reticulum (Fig. 1f-h), CD47-SU primarily localizes to 
the endoplasmic reticulum, but also localizes partially to the cell sur- 
face, but independently of active RAC1 (Fig. 4b, c). 
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Figure 1 | The long 3’ UTR of CD47 localizes GFP-TM protein to the 
plasma membrane, whereas the short 3’ UTR localizes it to the endoplasmic 
reticulum. a, Fluorescence confocal microscopy of endogenous CD47 protein 
in non-permeabilized (top) and permeabilized (bottom) cells. IC, intracellular. 
b, 3’-seq analysis of naive B cells shows two 3’ UTR isoforms of CD47 mRNA 
(short 3’ UTR (SU) and long 3’ UTR (LU)). Shown is the last exon of the gene 
model. Isoform abundance shown in transcripts per million (TPM). 

c, Northern blot analysis of human cell lines confirming CD47 mRNA isoforms 
from b. The corresponding ethidium-bromide-stained RNA gel is shown as 
loading control. d, Staining of U2OS cells as in a after transfection of a control 
shRNA (sh Co) or an shRNA against the long CD47 3’ UTR isoform. e, CD47 
protein contains an N-terminal signal peptide (S; green), ECD (blue), five 
TMDs (grey) and a cytoplasmic C terminus (C; red). In both constructs, the 
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ECD was replaced with GFP and either fused with the long (GFP-TM-LU) or 
the short CD47 3’ UTR (GFP-TM-SU). Constructs are drawn to scale. 

f, Fluorescence confocal microscopy of fixed U2OS cells after transfection of 
GFP-TM-LU or GFP-TM-SU. Bottom, with additional staining of the 
endoplasmic reticulum with anti-calnexin. g, FACS analysis of GFP expression 
in transfected U2OS cells with (dark blue lines, detection of total expression) 
and without permeabilization (light blue lines, detection of surface (surf.) 
expression). Values for mean fluorescence intensity (MFI) are shown in 
parentheses. Unstained cells are shown in grey. Representative image from 
more than 20 experiments. h, RNA-fluorescence in situ hybridization (FISH) 
(red) against GFP in permeabilized U2OS cells after transfection of GFP-TM- 
LU or GFP-TM-SU. Bottom panel also shows GPF-TM protein. a, d, fandh are 
representative images from hundreds of cells. Scale bars, 10 [um. 
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Figure 2 | 3’ UTR-dependent protein localization (UDPL) depends on HuR, 
SET and RACI, and mediates surface localization of membrane proteins. 
a, Model of UDPL. HuR binds to the long 3’ UTR and recruits SET. During 
translation of CD47 mRNA, this protein complex is targeted to the endoplasmic 
reticulum (ER) surface where SET binds to the newly translated cytoplasmic 
domains of CD47. This step probably requires energy. SET interacts with RAC1 
and active RAC] translocates SET and CD47 to the plasma membrane. b, FACS 
analysis of endogenous CD47 protein expression in HEK293 cells. Left panel is 
after transfection of control shRNA (sh Co) or shRNAs against HuR (shown are 
all GFP™ cells). Middle and right panels depict cells stably expressing the 
indicated shRNAs. Surface CD47 (top) and total CD47 protein (bottom) were 
measured. c, 3'-seq analysis for CD44 in HEK293 cells and for TNFRSF13C in 


We added the respective ECDs to CD44 and BAFFR, which also 
increased surface expression of their SU isoforms compared with their 
GFP-TM isoforms, but to a lesser extent than was observed for CD47 
(Extended Data Fig. 7c, d). Di- or multimerization of cell surface 
receptor subunits, which often occurs through their ECDs, is a com- 
mon strategy for overcoming endoplasmic reticulum retention, 
because it results in masking of endoplasmic reticulum retention sig- 
nals’’. We speculate that CD47-SU, CD44-SU and BAFFR-SU might 
use such a mechanism (although the multimerization partners are 
unknown) for their partial surface expression. In the case of BAFFR 
the ECD only increased surface expression by 1.2-fold, indicating that 
BAFFR strongly depends on UDPL for surface expression (Extended 
Data Fig. 7d). This is supported by the absence of BAFFR on B cells in 
Rac-deficient mice”. Taken together, our data suggest that membrane 
proteins rely on UDPL for surface expression to varying degrees. 

Cells with high CD47 surface levels are protected from phagocytosis 
by macrophages (Extended Data Fig. 7e)'*. To examine if the differ- 
ence in surface expression of CD47-LU and CD47-SU protects cells to 
a different extent from phagocytosis, we used CD47-deficient Jurkat 
cells (called JinB8 cells’) and expressed similar total amounts of 
CD47-LU or CD47-SU (Extended Data Fig. 7a). Co-culture of these 
cells with macrophages demonstrated that CD47-LU fully protected 
the cells, whereas CD47-SU only partially protected the cells from 
phagocytosis (Fig. 4d). 

CD47 also functions in the regulation of apoptosis”, as JinB8 cells or 
tissues from Cd47-knockout mice” fail to undergo apoptosis after 


10? 108) = 104 10° 10' = 102 S108 = 104 


10° 10? 


102, 108 = 104 


B-LCL cells, as shown in Fig. 1b. FACS analysis of endogenous CD44 protein in 
HEK293 cells (left) and endogenous BAFFR protein in SHSY-5Y cells (right) 
shown as in b (left). d, Left, FACS analysis of GFP after transfection of 
constructs containing a signal peptide and GFP fused to the TMD and C 
terminus of CD44 and either the long 3’ UTR (dark blue line) or the short 

3’ UTR (light blue line) in U251 cells. Right, as in left panel, but for BAFFR with 
transfection into HeLa cells. e, FACS analysis of GFP expression shows that 
HuR-BS is sufficient for surface localization of GFP-TM (dark blue line). 
Deletion of the binding sites from the HuR-BS construct (HuR-BSA) abrogates 
GFP surface expression (light blue line). For b, c, d and e, surface and total 
protein expression were determined and shown as in Fig. 1g. Representative 
images from three (sh2 HuR, n = 5) biological replicates. 


y-irradiation”*”*. Interestingly, expression of CD47-SU in JinB8 cells 
restored apoptosis, but expression of CD47-LU did not affect the loss- 
of-apoptosis phenotype (Fig. 4e and Extended Data Fig. 7f, g). Thus, 
when a cell requires increased CD47 surface expression, transcrip- 
tional upregulation alone would be non-optimal as it would confer 
increased susceptibility to apoptosis. ApA-generated 3’ UTR isoforms 
allow independent regulation of differentially localized and function- 
ally distinct CD47 protein. 

As the surface localization of CD47-SU is RAC1-independent, it 
also does not co-localize with RAC1 at the plasma membrane 
(Fig. 4f). In contrast, CD47-LU shows strong co-localization with 
RAC] at lamellipodia (Fig. 4f). Both activated RAC1 and CD47 are 
necessary for efficient cell migration and activated RACI localizes to 
the leading edge of migrating cells***’°. We show that only the express- 
ion of CD47-LU resulted in changes in cell morphology with the 
generation of lamellipodia at the leading edge of cells. Furthermore, 
CD47-LU, but not CD47-SU, resulted in increased active RAC1 
(Fig. 4g), which suggests that CD47-LU may cooperate with RAC1 
during cell migration. Thus, CD47 protein localized to the same cel- 
lular compartment, but produced by the SU or LU mRNA isoforms, 
can exert different functions. It is currently not known if other surface 
proteins derived from their LU or SU isoforms also have distinct 
biological roles. 

We propose that UDPL is a widespread mechanism for surface 
expression of membrane proteins. All currently known components 
of the pathway (HuR, SET and RAC1) are ubiquitously and highly 
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Figure 3 | Mechanism of UDPL. a, Left, RNA-immunoprecipitation after 
transfection of the indicated constructs into HEK293 cells. Protein-RNA 
complexes were pulled-down with anti-HuR antibody and GFP abundance was 
normalized to GAPDH and is shown as fraction of input. Right, RNA co- 
immunoprecipitation after transfection of the indicated shRNAs. Protein- 
RNA complexes were pulled down with anti-SET antibody and the abundance 
of CD47-LU was normalized to GAPDH and is shown as fraction of input. 
Shown is mean + standard deviation (s.d.), n = 3 biological replicates. 

*EP < 0.0003, **P < 0.002, NS, not significant (P > 0.05), two-sided t-test for 
independent samples. b, MS2-binding sites (see Extended Data Fig. 5) were 
added to GFP-TM-SU and co-transfected with MS2-mC (left), MS2-mC-HuR 
(middle) or MS2-mC-SET (right). mC, mCherry. Fluorescence confocal 
microscopy of HEK293 cells after transfection of indicated constructs shows 


that recruitment of HuR or SET to the short 3’ UTR redirects localization of 
GFP protein from the endoplasmic reticulum to the cell surface. Representative 
images from hundreds of cells. Scale bars, 10 um. c, Co-immunoprecipitation of 
endogenous SET using anti-GFP antibody after transfection of CD47-SU or 
CD47-LU in HEK293 cells (for constructs, see Fig. 4a). Two per cent of input 
was loaded. WB, western blot. d, FACS analysis of surface GFP expression after 
transfection of GFP-TM-LU (dark blue line), GFP-TM-LU with a C-terminal 
deletion (AC; light blue line; left) or with destruction of both SET-binding sites 
(AC and K163A, K166A, K175A; ACL; light blue line, right). Shown as in 
Fig. 1g. Representative image from more than four experiments. e, Co- 
immunoprecipitation of endogenous SET using anti-GFP antibody after 
transfection of the indicated constructs. Two per cent of input was loaded. 
Asterisk indicates unspecific band. 
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Figure 4 | CD47 protein has different functions depending on whether 
it was generated by the SU or LU isoform. a, To generate GFP-CD47, 
GFP was inserted in frame between the signal peptide and the rest of the 
CD47 open reading frame. GFP-CD47 was fused with either the long or short 
CD47 3' UTR, called CD47-LU and CD47-SU, respectively. b, FACS analysis 
of surface (surf.; light blue) and total (dark blue) GFP-CD47 expression in 
transfected JinB8 cells. Shown as in Fig. 1g. Representative images from four 
experiments. c, FACS analysis of GFP expression after transfection of CD47- 
LU or CD47-SU with or without co-transfection of dominant-negative RAC1 
(N17RAC1). Shown as in Fig. 1g. Representative images from n = 7 (LU) and 
n= 2 (SU) experiments. d, Fraction of Mitomycin-C-treated cells that 
survived at day (d)3 after co-culture with macrophages is displayed for Jurkat, 
JinB8 (CD47~’~) and the GFP* JinB8 cells after nucleofection of CD47-LU 


366 | NATURE | VOL 522 | 18 JUNE 2015 


or CD47-SU. Shown is mean + s.d., n = 3 biological replicates. **P < 0.005, 
*P < 0.02, NS, not significant (P > 0.05), two-sided t-test for independent 
samples. e, The fraction of surviving cells (TO-PRO3 negative) measured by 
FACS analysis at day 3 after y-irradiation is shown for the same populations 
as in d. Shown is mean + s.d., n = 3 biological replicates of the 20% of cells 
with the highest GFP expression. Gy, Gray. f, Fluorescence confocal 
microscopy of permeabilized U251 cells after transfection of CD47-LU or 
CD47-SU co-stained with anti-RAC1 antibody. Yellow indicates co- 
localization. Representative images from hundreds of cells. Scale bars, 

10 um. g, Immunoprecipitation of endogenous RAC1-GTP (active RAC1) in 
HEK293 cells after transfection of CD47-LU, CD47-SU, or empty vector. 
Total RAC1 and GFP—CD47 were measured from input. n = 3 biological 
replicates. 
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expressed (Extended Data Fig. 8a)’. UDPL requires the presence of 
HuR-binding sites in the 3’ UTR and SET-binding sites in the cyto- 
plasmic domains of membrane proteins (Extended Data Figs 3, 4 and 
9). All candidates tested so far that met both requirements used UDPL 
for surface expression (Figs 1f-h, 2d, 4b and Extended Data Figs 2g, 
7c). HuR-binding sites are highly abundant as HuR binds to thousands 
of mRNAs®*’, with a third of them being membrane proteins 
(Extended Data Fig. 8b). Although the SET-binding motif is currently 
unknown, we (Fig. 3d, e and Extended Data Fig. 6) and others have 
shown that SET binds to positively charged amino acids in histone 
tails or cytoplasmic domains of membrane proteins'*”’. According to 
the positive-inside rule for integral membrane proteins, the cyto- 
plasmic domains of membrane proteins are enriched in positively 
charged amino acids for topological reasons”. Therefore, potential 
SET-binding sites in cytoplasmic domains of membrane proteins are 
very widespread. 

So far, efforts to determine the consequences of alternative 3’ UTRs 
have largely focused on mRNA stability and translation’*””. We 
expand the known functions of 3’ UTRs and show that they can act 
as scaffolds for RBPs that serve as adaptors to recruit effector proteins 
to the site of translation, which determines subcellular protein local- 
ization and function. With respect to CD47, CD44, ITGA1, TSPAN13 
and BAFFR, the adaptor protein is HuR and the effector protein is SET. 
We further speculate that the scaffold function of 3’ UTRs may extend 
beyond the regulation of membrane proteins. RBPs could recruit other 
effector proteins, for example, enzymes that post-translationally mod- 
ify proteins, as was shown for long non-coding RNAs”. We showed 
here that CD47 produced by alternative 3’ UTR isoforms localizes to 
different cellular compartments and has independent and sometimes 
opposite functions with respect to cell survival and cell migration. 
Thus, through the generation of alternative 3’ UTR isoforms, ApA 
contributes to functional diversity of the proteome without changing 
the amino acid sequence. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell lines. MCEF7 (breast cancer), HeLa (cervical cancer), HEK293 (embryonic 
kidney), Caov-3 (ovarian carcinoma), NTERA2 (embryonic carcinoma) and 
THP-1 cells (monocytic leukaemia) were purchased from ATCC. B-LCL are 
Epstein Barr virus (EBV)-immortalized human B cells described earlier’. U2OS 
cells (sarcoma) were a gift from T. Brummelkamp, Toledo (B-cell lymphoma) cells 
were a gift from M. Mueschen, U251 (glioblastoma) cells were a gift from I. 
Mellinghoff, SHSY-5Y (neuroblastoma) cells were a gift from T. Tuschl and 
Jurkat (T-cell leukaemia) and JinB8 (CD47-negative Jurkat) cells were a gift from 
W. Frazier. 

Constructs. For some of the shRNA knockdown experiments, pSUPERretropuro 
was modified by cloning IRES::GFP (derived from pMSCVpig)’ downstream of 
puromycin to obtain pSUPERretropuro containing enhanced (e)GFP (shRNA- 
GFP). The following DNA oligonucleotides served as shRNA precursors and were 
cloned into pSUPER-GFP or pSUPER. CD47-LU2F: 5'-GATCTCCCAGCTGT 
GTTACCGTTAAATTCAAGAGATTTAACGGTAACACAGCTGTTTTTC-3’; 
CD47-LU3F: 5’-GATCCCCCAGCTGTGTTACCGTTAAATTCAAGAGATT 
TAACGGTAACACAGCTGTTTTTC-3’; HuR2F: 5’-GATCTCCGATCAGA 
CTACAGGTTTGTTTCAAGAGAACAAACCTGTAGTCTGATCTTTTTC-3’; 
HuR3F: 5'- GATCCCCGAGGCAATTACCAGTTTCATTCAAGAGATGAAA 
CTGGTAATTGCCTCTTTTTC-3'; HuR4F: 5’-GATCCCCTCTTAAGTTTCG 
TAAGTTATTCAAGAGATAACTTACGAAACTTAAGATTTTTC-3’; SETIF: 
5'-GATCCCCTGAAATAGACAGACTTAATTTCAAGAGAATTAAGTCT 
GTCTATTTCATTTTTC-3’; SET2F: 5’- GATCTCCCTGGTTTACTGAC 
CATTCTTTCAAGAGAAGAATGGTCAGTAAACCAGTTTTTC-3'; RAC2F: 
5'-GATCCCCCTTGCCTACTGATCAGTTATTCAAGAGATAACTGATCA 
GTAGGCAAGTTTTTC-3'; RAC3F: 5’-GATCCCCGTCCCTTGGAACCT 
TTGTATTCAAGAGATACAAAGGTTCCAAGGGACTTTTTC-3’; ControlF: 
5'-GATCTCCTTCTCCGAACGTGTCACGTTTCAAGAGAACGTGACACG 
TTCGGAGAATTTTTC-3’. 

The sequence of shRNA1 SET and of shRNA Control were published earlier?” 
The GFP fusion constructs were generated in pcDNA3.1 expression vector (Life 
Technologies). The short and long 3’ UTRs of CD47 were PCR-amplified from 
genomic DNA using Q5 High Fidelity DNA polymerase (NEB) and the primers 
listed later and inserted between the NotI and XbalI sites. To obtain expression of 
only the long 3’ UTR isoform of CD47, the proximal polyadenylation site was 
mutated from AAUAAA to ACUCAA using the QuikChange Multi Site Directed 
Mutagenesis Kit (Agilent). The resulting plasmids were used to test qPCR primers 
for accuracy in measuring short to long 3’ UTR isoform ratios (see later). The short 
3' UTR of CD47 used in the MS2-BS construct was cloned from the plasmid 
containing the long 3’ UTR with the mutated proximal polyadenylation site. 
eGFP was PCR-amplified from pMSCV-pig and inserted upstream of each 
CD47 3' UTR (BamHI, NotI). The signal peptide of CD47 was generated by 
annealing two DNA oligonucleotides that were inserted into the BamHI site. To 
generate the GFP-TM constructs, the sequence of the TMDs and C-terminal tail of 
CD47 (the longest isoform, isoform 4; ref. 33) were cloned from Toledo cDNA and 
inserted downstream of eGFP (Bsrgl, NotI). Isoform 4 was chosen as it is the most 
abundant isoform in Jurkat cells (data not shown). The two nucleotides of eGFP 
that occur after the Bsrgl site were included in the forward primer. To generate the 
GFP-CD47 constructs, the ECD, TMDs and C terminus of CD47 were PCR- 
amplified from Toledo cDNA using the TM reverse primer and the CDS forward 
primer and inserted downstream of eGFP (Bsrgl, NotI). The sequence of HuR-BS 
and HuR-BSA are shown in Extended Data Fig. 3 and replaced the long 3’ UTRin 
GFP-TM-LU. The GFP constructs containing the TMDs and C termini fused to 
either the long or short 3’ UTRs of CD44, ITGA1 and TNFRSF13C were generated 
as follows. TMDs, C termini and short 3’ UTRs of CD44, ITGA1 and TNFRSF13C 
were cloned from Toledo, SHSY-5Y and B-LCL cDNA respectively and inserted 
downstream of eGFP (Bsrgl, XbaI). Short 3’ UTRs consisted of the first 122 
nucleotides of CD44, the first 45 nucleotides of [TGA] and the first 337 nucleotides 
of TNFRSF13C. Long 3’ UTRs: 3,068 nucleotides after the stop codon of CD44 and 
3,996 nucleotides after the stop codon of ITGA1 were used. For TNFRSF13C, 1,221 
nucleotides after the stop codon together with the last 600 nucleotides of the 
3’ UTR (nucleotides 2,712-3,311 after the stop codon containing the majority 
of HuR-binding sites) were used as long 3’ UTR and were cloned from genomic 
DNA. To generate BAFFR-SU and -LU, the complete open reading frame of 
TNFRSF13C (without the start codon) was amplified from human B-cell cDNA 
and cloned using Gibson Assembly Cloning (NEB) into pcDNA3.1 vector used 
above downstream of eGFP. To generate CD44-SU and -LU, the open reading 
frame of CD44 (without the start codon) was amplified from cDNA of the human 
breast cancer cell line MDA-MB231 and cloned using Gibson Assembly Cloning 
(NEB) into pcDNA3.1 vector used above downstream of eGFP. To generate the 
GFP-TM-LUAC construct the sequence of just the TMDs of CD47 was cloned 
from Toledo cDNA using the TM forward primer and TMAC reverse primer and 


inserted downstream of eGFP (Bsrgl, NotI). The 24 MS2-binding sites were cloned 
from a plasmid obtained from J. Gerst™ using the primers listed later (Xbal, Apal). 
The constructs in which K163, K166, K175, K290, K297 and K304 were mutated to 
alanines were generated using the QuikChange Multi Site Directed Mutagenesis 
Kit (Agilent). 

CD47UTRE: 5’-ATGCGCGGCCGCAGTGAAGTGATGGACTCCGATT-3’; 
CD47shortUTRR: 5'-ATGCTCTAGATGGGCAAACAACATAGATCA-3’; 
CD47longUTRR: 5'-ATGCTCTAGAAACACATTGGACTGATTTAAAACTT-3’; 
GFPF: 5’-ATGCGGATCCATGGTGAGCAAGGGCGA-3’; GFPR: 5’-ATGCG 
CGGCCGCTTACTTGTACAGCTCGTCCATG-3’; SPCD47F: 5'-GATCCAT 
GTGGCCCCTGGTAGCGGCGCTGTTGCTGGGCTCGGCGTGCTGCGGAT 
CAGCTG-3'; SPCD47R: 5'-GATCCAGCTGATCCGCAGCACGCCGAGCC 
CAGCAACAGCGCCGCTACCAGGGGCCACATG-3'; CD47TMF: 5’-ATGC 
TGTACAAGATTCTTATTGTTATTTTCCCAATT-3'; CD47TMR: 5'-ATGC 
GCGGCCGCTTATTCATCATTCATCATTCCTT-3’; CD47CDSF: 5’-ATGCT 
GTACAAGCAGCTACTATTTAATAAAACAA-3'; TMACR: 5'-ATGCGCGG 
CCGCTTATTTCATATAAACTAGTCCAAGTAA-3’; MS2-BSF: 5'-ATGCTC 
TAGAGGGCCCTATATATCGATCCTAAG-3'; MS2-BSR: 5’-ATGCGGGCC 
CTTTATTATGCTTGGTACCGAGCTCG-3’. 

To create the MS2 fusion constructs”, the pUG34-MS2-GEFP-SBP plasmid was 
obtained from J. Gerst**. SBP was replaced by either HuR, SET or a stop codon. 
HuR and SET were PCR-amplified from U2OS cDNA using the primers listed 
below (Bsrgl, XbaI). The stop codon was generated by annealing two DNA oligo- 
nucleotides (Bsrgl, XbaI). mCherry was PCR-amplified and replaced GFP 
(BamHI, Bsrgl). After cloning was complete, all plasmids were sequenced to assure 
fidelity of the sequences. 

HuRF: 5'-ATGCCGTACGAGTCTAATGGTTATGAAGACCACA-3’; HuRR: 
5'-ATGCTCTAGATTATTTGTGGGACTTGTTGGT-3’; SETF: 5’-ATGCTGT 
ACAAGTCGGCGCCGGCGGCCAAA-3’; SETR: 5'-ATGCTCTAGATTAGT 
CATCTTCTCCTTCATCCTC-3’; StopF: 5’-GTACAAGTAATAATAAT-3’; 
StopR: 5’-CTAGATTATTATTACTT-3’; mCherryF: 5’-ATGCGGATCCGTG 
AGCAAGGGCGAGGAG-3’; mCherryR: 5'-TTACTTGTACAGCTCGTCCA 
TGC-3'. 

The N17RACI construct was provided by A. Hall’*. 

Transfections. For transfections into U2OS, HEK293, HeLa and U251 cells 
Lipofectamine 2000 (Invitrogen) and for SHSY-5Y cells Xtreme reagent (Roche) 
was used. JinB8 and Jurkat cells were transfected using the Neon Transfection 
System (Invitrogen) according to the manufacturer’s protocol for transfecting 
Jurkat cells. To account for differences in the sizes of transfected plasmids the 
same molar amounts were transfected. When RNA was to be extracted, the cells 
were grown in the presence of high amounts of puromycin (4 mg ml) for 3 days 
and FACS analysis was performed to ensure that >90% of the cells were GFP. 
Generation of cell lines with stable expression of shRNAs. Stable cell lines were 
generated as described previously”. 

FACS analysis. For surface FACS (in order to detect surface protein expression), 
cells were incubated with mouse anti-CD47-PerCy5.5 (BD Biosciences, 561261), 
mouse anti-CD44-PE (BD Biosciences, 561858), chicken anti-GFP (Abcam, 
ab13970), mouse anti-ITGA1-PE (BD Biosciences, 555749), mouse anti- 
BAFFR-PE (BD Biosciences, 554680) or rabbit anti-TSPAN13 (Genetex, 
GTX52155) in FACS buffer A (0.5% FBS in PBS) for 30 min at 4°C, and then 
washed twice in FACS Buffer A. For detection of GFP and TSPAN13, cells were 
then incubated with goat anti-chicken Alexa Fluor 568 or 633 (Invitrogen, A11041 
or A21103) and goat anti-rabbit Alexa Fluor 680 (Invitrogen, A-21076), respect- 
ively, for 30 min at 4 °C, and then washed twice in FACS Buffer A. At least 30,000 
cells were analysed on a BD FACSCalibur cell analyser (BD Biosciences) and FACS 
data were computed using the FlowJo VX software. 

For intracellular FACS (in order to detect total protein expression), cells were 
fixed for 15 min at room temperature in fixation buffer (4% PFA, 0.02% sodium 
azide, and 0.1% Tween 20 in PBS), washed in FACS buffer B (0.02% sodium azide, 
0.1% Tween 20 in PBS), permeabilized for 10 min at 4°C in permeabilization 
buffer (0.02% sodium azide, 0.1% Tween 20 and 10% dimethyl sulfoxide in 
PBS), washed, re-fixed for 5min at room temperature in fixation buffer, and 
washed again. Cells were incubated with the same primary and secondary anti- 
bodies as for surface FACS in FACS buffer B for 30 min at 4 °C, and then washed 
twice in FACS Buffer B. For live/dead analysis by FACS, cells were incubated with 
TO-PRO3 in FACS buffer A (0.5% FBS in PBS) for 10min at 4°C, and then 
washed twice in FACS Buffer A. Cells were analysed is the same manner as for 
surface FACS. 

For all the GFP-expressing plasmids the 20% of cells with the highest GFP 
expression are shown. 

Immunocytochemistry. For surface staining of CD47, cells were fixed for 15 min 
at room temperature in fixation buffer A (4% PFA and 0.02% sodium azide in 
PBS), washed with PBS, blocked for 15min at 4°C in 5% Normal Goat Serum 
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(Invitrogen, PCN5000) in PBS and then incubated with mouse anti-human CD47 
(Santa Cruz, sc-59079) primary antibody for 1h at 4°C in PBS. After washing in 
PBS, donkey anti-mouse Alexa Fluor 594 (Invitrogen, A-21203) secondary anti- 
body was incubated for 1 h at 4 °C in PBS, and during the last 10 min of incubation 
4',6-diamidino-2-phenylindole (DAPI; Invitrogen, D1306) was added, followed 
by three washes of PBS. Mowiol mounting media (Sloan Kettering Institute, 
Molecular Cytology Core Facility) was used to mount the slides. Imaging was 
performed at the Sloan Kettering Institute Molecular Cytology Core Facility, on 
a Leica TCS SP5 confocal microscope, using a X63, 1.4 numerical aperture oil 
objective. 

For intracellular staining of CD47, cells were fixed for 15 min at room temper- 
ature in fixation buffer B (4% PFA, 0.02% sodium azide, and 0.1% Tween 20 in 
PBS), washed in wash buffer (0.02% sodium azide, 0.1% Tween 20 in PBS), 
permeabilized for 10 min at 4°C in permeabilization buffer (0.02% sodium azide, 
0.1% Tween 20 and 10% dimethy1 sulfoxide in PBS), washed, re-fixed for 5 min at 
room temperature in fixation buffer B, washed and blocked for 15 min at 4°C in 
5% Normal Goat Serum in wash buffer. Mouse anti-CD47 (Santa Cruz, sc-59079) 
primary antibody was incubated overnight at 4°C in wash buffer. The strong 
permeabilization and overnight staining were necessary to visualize intracellular 
CD47, as the CD47 antibody recognizes an epitope in the ECD of CD47, which is 
located in the lumen of the endoplasmic reticulum. Owing to the extended treat- 
ment with a buffer containing Tween 20 the plasma membrane could no longer be 
visualized in these cells. 

For co-staining of GFP with calnexin or RAC1, the cells were fixed for 15 min at 
room temperature in fixation buffer B (4% PFA, 0.02% sodium azide, and 0.1% 
Tween 20 in PBS), washed with wash buffer B, blocked for 15 min at 4°C in 5% 
Normal Goat Serum (Invitrogen, PCN5000) in wash buffer and then incubated 
with rabbit anti-calnexin (Santa Cruz, sc-11397) or mouse anti-RAC1 (Abcam, 
ab12048) primary antibodies for 1h at 4°C in wash buffer. After washing, goat 
anti-rabbit Alexa Fluor 680 (Invitrogen, A-21076) or donkey anti-mouse Alexa 
Fluor 594 (Invitrogen, A-21203) secondary antibodies were incubated for 1h at 
4°C in wash buffer, and during the last 10 min of incubation DAPI (Invitrogen, 
D1306) was added, followed by three washes. Mounting and imaging was per- 
formed as for the surface staining. Owing to the lack of permeabilization and short 
period in Tween 20, the plasma membrane was still visible. When calnexin was co- 
stained with endogenous CD47, the protocol for intracellular staining of CD47 was 
used and the plasma membrane was again not visible. 

Alexa Fluor 594 and 680 were pseudo-coloured red and endogenous GFP and 
mCherry were imaged as they appear, without any antibody. 

RNA-FISH. Custom Stellaris FISH Probes (Biosearch Technologies) were 
designed for the open reading frame of eGFP using the Stellaris Probe designer 
website, and with the assistance of Biosearch Technologies staff. The probes were 
conjugated to the Quasar 670 fluorochrome. Staining was carried out according to 
the manufacturer’s protocols. Briefly, 24h after transfection of the GFP-TM con- 
structs cells were trypsinized and plated on Millicell EZ glass slides (Millipore) and 
allowed to grow overnight. Cells were washed in PBS, fixed in 4% PFA at room 
temperature for 10 min and permeabilized in 70% ethanol at 4 °C for 2h. After 
washing, the probes were hybridized at 37°C for 4h, washed, incubated with 
DAPI at 37°C for 30 min, washed and mounted in Mowiol mounting media. 
Imaging was performed as for the surface immunostaining. RNA was pseudo- 
coloured red, and GFP was imaged as it appears, without any antibody. 

3’-seq. 3'-seq reads of naive B cells, B-LCL and HEK293 cells were analysed and 
visualized as described previously’. 

Northern blot analysis. Northern blots were performed as previously described’. 

CD47 probe F: 5’-TTGATGGAGCTCTAAACAAGTCC-3’; CD47 probe R: 
5'-GAATAACCAATATGGCAATGACG-3’; GFP probe F: 5’-TAAACGGCC 
ACAAGTTCAGC-3’; GEP probe R: 5'-CTTGTACAGCTCGTCCATGC-3’. 
Quantitative PCR. RNA was extracted using TRI Reagent (Ambion) according to 
the manufacturer’s protocol. CDNA was synthesized using random hexamers and 
the TaqMan Reverse Transcription Kit (Applied Biosystems). qRT-PCR was 
performed using the Power SYBR Green master mix (Applied Biosystems) on a 
7500 HT Fast Real-Time PCR System (Applied Biosystems). Each reaction was 
performed in triplicate. The experiments were performed at least three times to 
obtain at least three biological replicates. The following primers were used to 
quantify total CD47 mRNA, the long 3’ UTR isoform of CD47 and GAPDH for 
normalization. 

CD47TotalF: 5’-AGTGATGGACTCCGATTTGG-3’; CD47TotalR: 5’-GGG 
TCTCATAGGTGACAACCA-3'; CD47LongF: 5'-AAGAGAACTCCAGTGTT 
GCT-3'; CD47LongR: 5’-ACGGTAACACAGCTGTAAAACA-3’';, GAPDHF: 
5'-ACAACTTTGGTATCGTGGAAGG-3’; GAPDHR: 5'-TATTTGGCAGGTT 
TTTCTAGACG-3’. 

To measure 3’ UTR isoform expression by qRT-PCR and to take into account 
different affinities of primers, we generated a standard using plasmids that 
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contained either the short or the long 3’ UTR of CD47 (see earlier). We mixed 
together known quantities of the two plasmids ranging from 3:1 to 1:3 of short to 
long 3’ UTR and performed qPCR on these mixtures to test the accuracy of our 
primer sets. The fraction of the long 3’ UTR isoform was calculated by subtracting 
the CT value of the long isoform from the CT value for total CD47 expression. The 
fraction of the long 3’ UTR isoform corresponds to 2“ “"**"©, The fraction of the 
short 3’ UTR isoform was obtained by subtracting the fraction of the long 3’ UTR 
isoform from the total CD47 mRNA. 
Immunoprecipitation of RNA complexes and RT-PCR. GFP-TM-LU, GFP- 
TM-SU, GFP-TM-HuR-BS and GFP-TM-HuR-BSA were transfected into 
HEK293 cells and immunoprecipitation of protein-RNA complexes was carried 
out as previously described*”. RNA-immunoprecipitations were performed with 
crosslinking to prevent re-association of HuR with mRNA after lysis’*. Briefly, 
cells were harvested and washed twice in cold PBS. Formaldehyde was added to a 
final concentration of 1% (v/v) and the cells were incubated at room temperature 
for 10 min. The reaction was quenched by addition of glycine to a final concen- 
tration of 0.25 M and incubated at room temperature for 5 min. After centrifu- 
gation the cell pellet was washed twice in cold PBS and resuspended in RIPA buffer 
(25 mM Tris-HCl (pH 7.4), 150 mM NaCl, 1% NP-40, 1% Na-deoxycholate, 0.1% 
SDS, 1mM EDTA, protease inhibitor cocktail (Roche, 04693124001)). The 
mRNPs were solubilized by three rounds of sonication for 15s each in a 
Misonix Ultrasonic Processor $-4000 at an output of 8-9 W. Insoluble material 
was removed by centrifugation. Lysates were pre-cleared by addition of magnetic 
protein A beads (Millipore, LSKMAGA 10) and incubation for 30 min at 4 °C with 
constant mixing. The pre-cleared lysate was divided into three parts. One portion 
was retained for the input control and anti-HuR (Millipore, 07-1735) or IgG 
(Santa Cruz, sc-2025) were added to the two other portions and incubated 
at 4°C for 2h. Magnetic protein A beads (Millipore, LSKMAGA10) were 
added and incubated at 4°C for 1h. The beads were washed five times with 
high-stringency RIPA buffer (50mM Tris-HCl (pH 7.4), 1M NaCl, 1% NP-40, 
1% sodium deoxycholate, 0.1% SDS and 1mM EDTA). The crosslinking was 
reversed by resuspending the beads in 50mM Tris-Cl (pH 7.0), 5mM EDTA, 
10mM dithiothreitol (DTT) and 1% SDS, followed by incubation at 70°C for 
45min. RNA was extracted from the beads and buffer using TRI Reagent 
(Ambion) according to the manufacturer’s protocol, and cDNA was synthesized 
as previously described. qRT-PCR was carried out using primers for GAPDH (see 
earlier) and GFP, so as not to amplify endogenous CD47. The primers used were as 
follows: GFPF: 5’-TAAACGGCCACAAGTCAGC-3’; GFPR: 5’-AAGTCGT 
GCTGCTTCATGTG-3’. 
HEK293 cells transiently transfected with sh2 HuR or sh Co were used to assess 
the presence of SET on endogenous CD47 mRNA and the requirement of HuR 
for this association. Cells were transfected and treated with high dose puromycin 
for 3 days. FACS analysis was used to ensure greater than 90% of surviving cells 
expressed the shRNA. RNA-immunoprecipitation was carried out as described 
earlier, but using anti-SET (Abcam, ab181990) or IgG (Santa Cruz, sc-2025). The 
following primers were used for (RT-PCR of CD47-LU: CD47F: 5’-AAGAG 
AACTCCAGTGTTGCT-3’; CD47R: 5’-ACGGTAACACAGCTGTAAAACA-3’. 
The obtained CT values were first normalized to GAPDH and then the fraction 
of input mRNA was plotted. 
Western blotting. HEK293 cells stably expressing sh3 HuR, sh4 HuR, sh1 SET, 
sh2 SET, sh2 Racl, sh3 Racl or sh Co, as well as HEK293 cells transiently trans- 
fected with sh2 HuR, CD47-SU, CD47-LU, CD47-LU+N17Racl, CD47-LU2Km, 
CD47-LUAC or CD47-LUACL, were lysed in Laemmli buffer (Sigma, $3401), 
boiled for 7 min and then cooled on ice. Lysates were run on NuPAGE Novex 
4-12% Bis-Tris Gel (Invitrogen, NP0322BOX) and transferred to PVDF mem- 
brane (Bio-rad, 162-0177). After blocking for 1 h at room temperature in Odyssey 
Blocking Buffer (Li-Cor, 927-40000) the following primary antibodies were used: 
rabbit anti- HuR (Millipore, 07-1735), rabbit anti-SET (Abcam, ab181990), mouse 
anti-RAC]I (Cell Signaling Technology, 8631S), mouse anti- ACTB (Sigma, A4700), 
rabbit anti-ACTB (Sigma, A2066), chicken anti-GFP (Abcam, ab13970), mouse 
anti-CD47 (Santa Cruz, sc-59079), mouse anti-CD44 (BD Bioscience, 561858), and 
rabbit anti-TSPAN13 (GeneTex, GTX52155). The antibodies were diluted in 
Odyssey Blocking Buffer containing 0.1% Tween 20 and the blots were incubated 
overnight at 4°C. After four washes in PBST (PBS plus 0.1% Tween 20) the blots 
were incubated for 1 h at room temperature in Odyssey Blocking Buffer containing 
0.1% Tween 20 and 0.01% SDS and the following secondary antibodies: donkey 
anti-mouse IRDye 700 (Rockland Immunochemicals, 610-730-002), donkey anti- 
rabbit IRDye 680 (Li-Cor Biosciences, 926-68073), donkey anti-rabbit IRDye 800 
(Li-Cor Biosciences, 926-32213), donkey anti-mouse IRDye 800 (Li-Cor 
Biosciences, 926-32212) and rabbit anti-chicken IRDye 800 (Rockland 
Immunochemicals, 603-432-002). The blots were washed four times in PBST 
and then thoroughly soaked in PBS before imaging. Imaging was performed on 
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an Odyssey CLx imaging system (Li-Cor). Quantification of western blots was 
performed using Image J. 

Co-immunoprecipitation. CD47-SU, CD47-LU, CD47-LUAC and CD47- 
LUACL were transfected into HEK293 cells and the cells were lysed in ice cold 
RIPA buffer ((25 mM Tris-HCl (pH 7.4), 150 mM NaCl, 1% NP-40, 1% Na-deox- 
ycholate, 0.1% SDS, 1mM EDTA, protease inhibitor cocktail (Roche, 
04693124001)) for 5 min on ice. After the cells were spun down at 20,000g for 
20 min the supernatant was pre-cleared as described earlier. The lysate was divided 
in two equal parts (a small portion was removed to be used as the input control). 
Anti-GFP (Invitrogen, A-6455) or IgG (Santa Cruz, sc-2025) was added to the 
lysates anda 1 h rocking incubation at 4 °C was performed, followed by addition of 
magnetic protein A beads (Millipore, LSKMAGA10) and another 1h rocking 
incubation at 4 °C. After seven washes in RIPA buffer the beads were then boiled 
in Laemmli buffer (Sigma, $3401) for 7 min and then cooled on ice. Western 
blotting was carried out as described earlier. Chicken anti-GFP (Abcam, 
ab13970) antibody was used to confirm immunoprecipitation of GFP constructs 
and rabbit anti-SET (Abcam, ab181990) antibody was used to assess co-immu- 
noprecipitation of SET protein. 

Immunoprecipitation of RAC1-GTP. CD47-SU, CD47-LU and pcDNA 3.1 
vector alone were transfected into HEK293 cells. The levels of RAC1I-GTP were 
assessed using an Active Racl Detection Kit (Cell Signaling Technology, 8815) 
following the manufacturer’s protocol. Briefly, cells were lysed in ice cold lysis 
buffer and centrifuged at 16,000g for 15 min at 4 °C. The supernatant was added to 
GST-PAK1-PBD and glutathione resin and incubated with rocking for 1h at 4°C 
(a small portion was removed to be used as the input control). The resin was 
washed three times with lysis buffer and then SDS sample buffer was added to elute 
the bound proteins. Western blotting was carried out as described earlier. Mouse 
anti-RACI (Cell signaling Technology, 8631S) antibody was used to detect RAC1- 
GTP as well as to assess total RAC1 in the samples. 

Irradiation and cell survival assay. Jurkat, JinB8, and transfected JinB8 cells (24h 
post-transfection) were irradiated for a total of 0, 5, 10, 20 or 40Gy using a 
Shepherd Mark-1 caesium irradiator. For JinB8 or Jurkat cells transfected with 
CD47-LU, CD47-SU or with sh2 CD47-LU the percentage of GEP* cells was 
determined by FACS before irradiation. Three days after y-irradiation, cells were 
stained with TO-PRO3 (Sloan Kettering Institute, Flow Cytometry Core Facility) 
in FACS buffer A (0.5% FBS in PBS) for 10 min at 4°C and analysed using a BD 
FACSCalibur cell analyser to evaluate cell survival. Per cent survival was calculated 
as the TO-PRO3-negative cells divided by the total number of GFP” cells. Shown 
are mean and standard deviation of three biological replicates. 

Phagocytosis assay. Human macrophages were obtained by differentiation 
of THP-1 cells with 25ngml~* phorbol 12-myristate 13-acetate (PMA; Sigma) 
for 3 days. On day 3, Jurkat, JinB8, or transfected JinB8 cells were treated with 
10ugml~' Mitomycin C for 2.5h at 37°C and washed three times in media. 
Mitocycin C treatment halts cell division and allows for a more accurate assess- 
ment of the percentage of cells that are phagocytosed. For JinB8 cells transfected 


with CD47-LU or CD47-SU the percentage of GFP™ cells was determined by 
FACS before co-culture. The cells were either cultured alone or co-cultured with 
fully differentiated macrophages. After 3 days, cells were counted and the fraction 
of GFP* cells was determined by FACS analysis. The fraction of surviving cells 
after co-culture with macrophages was normalized by the number of surviving 
cells without co-culture and is show as mean and standard deviation of three 
independent experiments. 

To demonstrate that the cells were phagocytosed, during the last 10 min of 
Mitomycin C treatment the cells were also labelled with carboxyfluorescein suc- 
cinimidyl ester (CFSE; Invitrogen). Washing after Mitomycin C and CFSE treat- 
ment was carried out in cold media according to the manufacturer’s protocol. 
After co-culture, CFSE uptake by macrophages was measured by FACS analysis to 
demonstrate that a decrease in the number of surviving cells is due to phagocytosis 
by macrophages. 

Fraction of membrane proteins among HuR target genes. The list of HuR target 
genes was obtained from previous publications”’. The union of genes from both 
publications was analysed using gene ontology analysis” and all genes with the tag 
“membrane” were considered membrane proteins. This number is consistent with 
the number of membrane proteins obtained in yeast*®. Fisher’s exact test was used 
to test for significance. 

Statistical analysis. To test for significant differences between samples a two- 
sided t-test for independent samples was performed using SPSS. 
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Extended Data Figure 1 | Expression of the long CD47 3’ UTR isoform 
correlates with cell surface expression of CD47 protein. a, Fluorescence 
confocal microscopy of cells shown as in Fig. 1a. Representative images out of 
hundreds of cells are shown. Scale bars, 10 tum. b, FACS analysis of endogenous 
CD47 expression in cells shown in Fig. 1a and a. Permeabilized cells show total 
CD47 expression (purple) and non-permeabilized cells show surface CD47 
expression (blue). Representative histograms are shown (HEK293 cells, n = 10; 
U20S, Jurkat cells, n = 5; Caov-3, n = 3). Unstained cells are shown in grey. 
c, Left, quantification of mean fluorescence intensity (MFI) values from 

b. Right, fraction of surface and intracellular CD47 levels in cells lines from 
b. Intracellular CD47 was calculated by subtracting CD47 surface values from 
total CD47 values. d, Northern blot of HEK293 cells stably expressing the 
indicated shRNAs and hybridized for CD47. The shRNAs against CD47-LU 
target only the long 3’ UTR isoforms of CD47. The blot and corresponding 
RNA gel are shown as in Fig. Ic. e, Quantification of CD47 total mRNA and 


CD47 Calnexin Nucleus 


3' UTR isoform levels in U2OS cells by (RT-PCR. GAPDH-normalized values 
after transfection of sh2 CD47-LU or sh Co are shown as the mean = s.d.,n = 3 
biological replicates. The total amount of CD47 mRNA after transfection of sh 
Co was set to 1. f, FACS analysis of endogenous CD47 protein expression after 
stable expression of shRNAs against CD47-LU in HEK293 cells. Surface (top) 
and total (bottom) CD47 expression is shown. Representative histograms out of 
n = 3 experiments are shown. Unstained cells are shown in grey. 

g, Quantification of MFI values from f is displayed. Intracellular CD47 was 
calculated as in b. h, Northern blot of HEK293 cells after transfection of 
indicated constructs and hybridized against CD47. Mutation of the proximal 
polyadenylation signal in CD47-LU abrogates production of short 3’ UTR 
isoforms. Asterisk indicates cross-hybridization to ribosomal RNAs. 

i, Fluorescence confocal microscopy of endogenous CD47 and calnexin protein 
in permeabilized U2OS cells. Calnexin partially co-localizes with CD47. A 
representative image out of hundreds of cells is shown. Scale bars, 10 um. 
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Extended Data Figure 2 | UDPL depends on HuR, SET and RACI and 
mediates surface localization of membrane proteins. a, Western blot of 
HEK2393 cells transiently transfected (left, middle) or stably expressing (right) 
sh Co or shRNAs against HuR. The blot shows reduced HuR protein expression 
after HuR knockdown, but no change in protein expression of CD47, 
TSPAN13, CD44 or SET. Actin was used as loading control. b, Quantification 
of CD47 total mRNA and 3’ UTR isoform levels in HEK293 cells by RT-PCR. 
GAPDH-normalized values after transfection of sh2 HuR or sh Co are shown. 
Shown is the mean = s.d., n = 3 biological replicates. The total amount of CD47 
mRNA after transfection of sh Co was set to 1. c, FACS analysis of HEK293 cells 
stably expressing the indicated shRNAs. Histograms are shown as in Fig. 2b. 
Representative histograms from n = 3 experiments are shown. d, Western blot 


of HEK293 cells stably expressing shRNAs against SET. Actin was used as 
loading control. The marker is shown in kDa. e, As in d, but HEK293 cells stably 
expressing shRNAs against RAC1 are shown. f, 3’-seq analysis shows 3’ UTR 
isoform expression of ITGA1 in B-LCL and TSPAN13 in HEK293 cells shown 
as in Fig. 1b. FACS analysis of endogenous protein levels is shown as in Fig. 2c. 
Left panel shows ITGA1 expression in HeLa cells and right panel shows 
TSPANI13 expression in HEK293 cells. Representative histograms from n = 2 
experiments are shown. g, FACS analysis of GFP after transfection of constructs 
containing a signal peptide and GFP fused to the TMD, C terminus and either 
the long 3’ UTR (dark blue line) or the short 3’ UTR (light blue line) of ITGA1 
in HEK293 cells. Representative histograms from n = 3 experiments are shown 
as in Fig. 2d. 
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CD47 


ctgaagtgaagtgatggactccgatttggagagtagtaagacgtgaaaggaatacacttgtgtttaagcaccatggccttgatgattcactgttggggagaagaaacaagaaaagtaactggttgtcacctatg 
agacccttacgtgattgttagttaagbttttattcaaagcagcetgtaatttagtt eeeataattatgatctatgttgtttgcccaattgagatccagbettetgetgttatetttaatcaattaggggceaa 
tagtagaatggacaatttccaagaatgatgcctttcaggtcctagggcectctggcctctaggtaaccagtttaaattggttcagggtgataactacttagcactgccctggtgattacccagagatatctatga 
aaaccagtggcttccatcaaacctttgccaactcaggttcacagcagctttgggcagttatggcagtatggcattagctgagaggtgtctgccacttctgggtcaatggaataataaattaagtacaggcagga 
atttggttgggagcatcttgtatgatctccgtatgatgtgatattgatggagatagtggtcctcattcttgggggttgccattcccacattcccccttcaacaaacagtgtaacaggtccttcccagatttagg 
gtachtttattgatggatatgbebbecrtttattcacataaccccttgaaaccctgtcttgtectectgttacttgcttctgctgtacaagatgtagcaccheebeEcctctttgaacatggtctagtgacacg 
gtagcaccagttgcaggaaggagccagacttgttctcagagcactgtgttcacacttttcagcaaaaatagctatggttgtaacatatgtattcccttcctctgatttgaaggcaaaaatctacagtgtttctt 
cacttettttctgatctggggcat gaaaaaagcaagattgaaatttgaactatgagtctectgcatggcaacaaaatgtgtgtcaccatcaggccaacaggecageccttgaatggggatttattactgttgta 
tctatgttgcatgataaacattcatcaccttcctcctgtagtcctgcectcgtactccccttcccctatgattgaaaagtaaacaaaacccacatttcctatcctggttagaagaaaattaatgttctgacagtt 
gtgatcgectggagtacttttagacttttagcattcgbtetteacctgtttgtggatgtgtgtttgtatgtgcatacgtatgagataggcacatgcatcttctgtatggacaaaggtggggtacctacaggaga 
gcaaaggttaattttgtgcttttagtaaaaacatttaaatacaaagttctttattgggtggaattatatttgatgcaaatatttgatcacttaaaacttttaaaacttctaggtaatttgccacgctttttgac 
tgctcaccaataccctgtaaaaatacgtaattcttcctgtttgtgtaataagatattcatatttgtagttgcattaataatagttatttcttagtccatcagatgttcccgtgtgcectcttttatgcecaaattg 
attgtcatatttcatgttgggaccaagtagtttgcccatggcaaacctaaatttatgacctgctgaggcctctcagaaaactgagcatactagcaagacagctcttcttgaaaaaaaaaatatgtatacacaaa 
tatatacgtatatctatatatacgtatgtatatacacacatgtatattcttccttgattgtgtagctgtccaaaataataacatatatagagggagctgtattcctttatacaaatctgatggctcctgcagca 
cbttttccttctgaaaatatttacattttgctaacctagtttgttactttaaaaatcagttttgatgaaaggagggaaaagcagatggact tgaaaaagatccaagctcctattagaaaaggtatgaaaatctt 
tatagtaaaabtttttataaactaaagttgtaccttttaatatgtagtaaactctcatttatttggggttcgetcttggatctcatccatccattgtgttctctttaatgetgectgecttttgaggceattcac 
tgccctagacaatgccaccagagatagtgggggaaatgccagatgaaaccaactcttgctctcactagttgtcagcttctctggataagtgaccacagaagcaggagtcctcctgcttgggcatcattgggcca 
gttccttctctttaaatcagatttgtaatggctcccaaattccatcacatcacatttaaattgcagacagtgttttgcacatcatgtatctgttttgtcccataatatgcetttttactccctgatcccagtttc 
tgctgttgactcttccattcaghtttatteattgtgtgttctcacagtgacaccabttgEectttectgcaacaacctttccagctacttttgecaaattctatttgtcttctccttcaaaacattctectttg 
cagttcctcttcatctgtgtagetgetcttttgtctcttaacttaccattcctatagtactttatgcatctctgcttagttctattagbbteteggecttgcetcttctecttgattttaaaattccttctatag 
ctagagcbtepetteetttcattctctcttcctgcagtgttttgcatacatcagaagctaggtacataagttaaatgattgagagttggctgtatttagatttatcactttttaatagggtgagcettgagagee 
GRSEEE ct tt cto BEREEEEt BEREGEE BECEEEE BREEEEE BESEteet BetttgactaatttcacatgcotctaaaaaccttcaaaggtgatBabeEeectcctggaaactccaggtccattctgtttaa 
atccctaagaatgtcagaattaaaataacagggctatcccgtaat tggaaatabbeettttttcaggatgctatagtcaatttagtaagtgaccaccaaattgttatttgcactaacaaagctcaaaacacgat 
aagtttactcctccatctcagtaataaaaattaagctgtaatcaaccttctaggbttetettgtcttaaaatgggtattcaaaaatggggatctgtggtgtatgtatggaaacacatactccttaatttacctg 
ttgttggaaactggagaaatgattgtcgggcaaccgtttabbttttattgbattttatttggt tgagggabbttettat aaacagttttacttgtgt cabatbtttaaaattactaactgcecatcacctgcetggg 
gtectttgttaggtcattttcagtgactaatagggataatccaggtaactttgaagagatgagcagtgagtgaccaggcagbeeeeetgcectttagcetttgacagttcttaattaagatcattgaagaccaget 
ttctcataaatttcHetttttgaaaaaaagaaagcatttgtactaagetcctctgtaagacaacatcttaaatcttaaaagtgttgttatcatgactggtgagagaagaaaacabtbtegettetattaaatgga 
gcattatttacaaaaagecattgttgagaattagatcccacatcgtataaatatctattaaccattctaaataaagagaactccagtgttgctatgtgcaagatcctctcttggagcEttttigcatagcaatt 
aaaggtgtgctatttgtcagtagccabbbetbtgcagtgatttgaagaccaaagbegebbeacagctgtgttaccgttaaaggbeeeteeetteatatgtattaaatcaatttatcactgtttaaagetttgaa 
tatctgcaatctttgccaaggtachbetbeabtbaaaaaaaaacataactttgtaaatattaccctgtaatattatatatacttaataaaacattttaagcbaeeeegetgggctatttctattgctgctacag 
cagaccacaagcacatttctgaaaaatttaatttattaatgbabeebeaagttgcttatattctaggtaacaatgtaaagaatgatttaaaatattaattatgaabeeteegagtataatacccaataagcettt 
taattagagcagagttttaBmgagttttaaatcagtec 


CD44 


cacctacaccattatcttggaaagaaacaaccgttggaaacataaccattacagggagctgggacact taacagatgcaatgtgctactgabegtttcat tgcgaatebtttttacEeeee Et ttctactet 
PRPEGEEREELGEGEEE EGE ECE tt aaagtcaggtccaatttgtaaaaacagceattgcetttctgaaattagggcccaattaataatcagcaagaatttgatcgttccagttcccacttggaggectttcatece 
tegggtgtgctatggatggcttctaacaaaaactacacatatgtattcctgatcgccaacctttcccccaccagctaaggacatttcccagggttaatagggcctggtccctgggaggaaatttgaatgggtcc 
attttgcecttccatagectaatccctgggceattgetttccactgaggttgggggttggggtgtactagttacacatcttcaacagaccccctctagaaabbetecagatgcettctgggagacacccaaagggt 
gaagctatttatctgtagtaaactatttatctgtgbeebegaaatattaaaccctggatcagtcctttgatcagtataabbbbebaaagttactttgtcagaggcacaaaagggtttaaactgattcat xia 
Htatctgtacttcttcgatcttcaccttttgtgctgtgattcttcagtttctaaaccagcactgtctgggtccctacaatgtatcaggaagagctgagaatggtaaggagactcttctaagtcttcatctcaga 
gaccctgagttcccactcagacccactcagccaaatctcatggaagaccaaggagggcagcacegEEELEGEELEEEgGEEEtEEgEeEtte EE Ettebtegacactgtccaaaggttttccatcctgtcctggaatec 
agagttggaagctgaggagcttcagectcttttatggtttaatggcecacctgttctctcctgtgaaaggctttgcaaagtcacattaagtttgcatgacctgttatccctggggccctatttcatagaggctgg 
ecctattagtgatttccaaaaacaatatggaagtgecttttgatgtcttacaataagagaagaagccaatggaaat gaaagagattggcaaaggggaaggatgatgcecatgtagatectgtttgacabbtttat 
ggctgtatttgtaaacttaaacacaccagtgtctgttcttgatgcagttgctatttaggatgagttaagtgcctggggagtccctcaaaaggttaaagggattcccatcattggaatcttatcaccagataggc 
aagtttatgaccaaacaagagagtactggctttatcctctaacctcatattttctcccacttggcaagtcctttgtggcatttattcatcagtcagggtgtccgattggtcctagaacttccaaaggctgcttg 
tcatagaagccattgcatctataaagcaacggctcctgttaaatggtatctcctttctgaggctcctactaaaagtcatttgttacctaaacttatgtgcttaacaggcaatgcttctcagaccacaaagcaga 
aagaagaagaaaagctcctgactaaatcagggctgggcttagacagagttgatctgtagaatatctttaaaggagagatgtcaactttctgcactattcccagectctgcetcctccectgtctacectctceccct 
ecetctctecctccacttcaccccacaatcttgaaaaacttectttctcttctgtgaacatcattggecagatccattttcagtggtctggabbeetetteabbtecttttcaacttgaaagaaactggacatt 
aggccactatgtgttgttactgccactagtgttcaagtgect chtgttbtcccagagatttcctgggtctgccagaggeccagacaggcetcactcaagctctttaactgaaaagcaacaagecactccaggaca 
aggttcaaaatggttacaacagectctacctgtcgececagggagaaaggggtagtgatacaagtctcatagecagagatggttttccactecttctagatattcccaaaaagaggctgagacaggaggttabe 
Pbcaabtttattttggaattaaatachbbtttccctttattactgttgtagtecctcacttggatatacctctgttttcacgatagaaataagggaggtctagagettctattccttggecattgtcaacggag 
agctggccaagtcttcacaaacccttgcaacattgcctgaagtttatggaataagatgtattcotcactcccttgatctcaagggcgtaactctggaagcacagcettgactacacgtcabetbeaccaatgattt 
tcaggtgacctgggctaagtcatttaaactgggtctttataaaagtaaaaggccaacabbeaabeabeeegcaaagcaacctaagagctaaagatgtaafeeebertgcaattgtaaatchttegegectcctg 
aagacttcccttaaaattagctctgagt gaaaaat caaaagagacaaaagacatcttcgaatccatatttcaagectggtagaattggettttctagcagaacctttccaaaagbtttatattgagattcataa 
caacaccaagaattgattttgtagccaacattcattcaatactgttatatcagaggagtaggagagaggaaacatttgacttatctggaaaagcaaaatgtacttaagaataagaataacatggtccattcacc 
tttatgttatagatatgtctttgtgtaaatcabbtgeetegagttttcaaagaatageccattgttcattcttgtgcetgtacaatgaccactgttattgttactttgacttttcagagcacacecttcctctgg 
Pttttgtatabttattgatggatcaataataatgaggaaagcatgatatgtatattgctgagttgaaageacttattggaaaatattaaaaggctaac Hea gactaaaggaaacaga 


HuR-BS 
tttaatagggtgagcttgagagbtetetteotttoegt eee tt eet tt Ett ttgactaatttcacatgctctaa 


HuR-BSA 


tttaatagggtgagcttgagag 


gactaatttcacatgctctaa 


Extended Data Figure 3| 3’ UTR isoforms that encode proteins using CD47, CD44, HuR-BS and HuR-BSA. Red, ApA signals. Blue, uridine-rich 
UDPL contain uridine-rich elements. Shown are the 3’ UTR sequences of elements with the potential to be HuR-binding sites. 
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ITGA1 


aatattttat gaBeeeeeelnel caattattcaataatctatcctcaggtttgectcaaatatgtgacaagaaatgtataattcatgacatagtcatgtaactatgtaatccatcagggattcattacttgg 
aaaatgacaggtcatgcattatccaaaaacaataccaaaaagacatattttataaaatgacaaaaaatattttacataattactcabttttgttgagtaagcaaaat tacaaagegttttbaaaaaaacctgta 
caaatatgtttatgtattaaatcaccatccaaaatatttaaggaatatataaaaagabbeeeatgatcatagaaacatctattttcaaaacaatataaattaagettttecccttgattectgttggatatcca 
tgttcagcatgacagtcagcactcgtaaatgccaagaaaagaattacctgaaaaagatcatttctccctattcaaatgagaatattttcccttggtagaatccatatacaatatggattgaaaattaagtcacc 
aagaatacatttagcatctttgtgttgggttctatacctaaaaattttccctccctgcccaaggtgattggaaagtactgcaattgagctaaaattttatctaggatggaaaatcaggatagactcgtttaaaa 
acaagtatattaaatgggttaaattaagcettttacagaacatgcttaattttgatattgagctctaaatgcaaaactcttttatactgaaaaaacacttaaagtagttttgtgtggcatcacagtgatttgtca 
tgaaaagecatatatgggggacatgctaaaatggettctgaatgaaatacgacagcagagaaagatgccactgaaatgctgaaattatctttgctgagcagettttgaatgctaagggttccagtatgtgacec 
aaagaaagagttgtctcaactecttggtacagggttcattcaaacccccaagctgtgagagtgtgbtbatttttaattttttaaaaggtatttaatttccaaccacacktebtatttttttgaaaagacaattt 
tecatttcagtattacattttaccaagaagtcactgcbbbebaggtgtgttgcatcaatatttaaataaaatcccagaaatgtattttaaagaatttcaggtttgaaaactcaaaccaacagiaaimatcaca 
cotttgtatgtgcttcBbegeEettgaagtagtcattatggttcattggaatctactgtagcataagetcctaggttgctccatttcttaggaaaatagcaatgaatgaatgtatagtagetgceatgtgtgaac 
tactgctgtaaagtttgctgacatttctgcaacaacatttctccatctgatttaccagaacctgtaacaagaccagaacatgtattcatccagaaactcctgggcetctgcetttgttcaaagaggagcaaatcca 
catagaaattaaatgttataaabbeateeeteeaaactaatttccaaagaatttagctgaagactcagagtcagectacctcttgactctccacatecgcatctaagatectgaaggecaaaggaatgegectg 
tgggatgccaaagccatcagtagctggaagaaagaatacctctttctagaggtttggtgtectcttgetggectgtcattatttggaaatacgcgettttggaatttcecttcattttcgcetatttccaggttca 
gaaatcatacctaccact tqbbeLebetbegcccagtaaaactcagggtttctcttattcagagttaccatgttgcacacceccagtgct cagectgeccagaageecttctctgatctcagcaagatcttagt 
agtaaagctaacggatccattgaaattcatactcagcatctgtgactactagacaaagceccatttactgaaccataagtgtgcacttttaaactaaattgttaactgaaagggaaaaaatagaaatgagggcat 
tttgaagBbebbagaaaagagaggaagagcetgttggecagatcaaagagatacttataageccggcacaabbbtbaacataagtcattcctggctattaagtaaaatatttcatgaaccagaagtcctgtttct 
ctaatattataatttgttgaggtttgtgtgtgtgtgtgtgtgcatgtgcagacacatgtacacatataaattgaatacaaaaatagctctataacaccaaaaaacagagtttacagaaaataaacacaaaagct 
atatctgcaagacaagtttgattcatctagtttcatatagtttcatatagbebeatebeeattaatatttaggtttaacttatagttctgcaattatttaaaaaacaagtacagagcegtttgattaaaaaaagt 
caacaaatgtttcaaaacagctttgtacaaatgttataaaacatatttaatgattaagaaatgtactagactctatcabbbacteebaaaacatttaattagaaaaacaggtgatatttaagactatttataaa 
cttattggccaaattgccaaaaagbbbebtebectctaactttggtattctaaataattgattattgctcccattgtaaactgaaccatgtatgceactctcttttctgaccaaactacaatggtatattggtaa 
attatttaacaacacactgtccaggagggaaaaaaaaaacagtctctgatttgtatcatttgcecaatttccatggtccaaatattctctctccattttcaaactaccaaagtgaccctaatcagcttacaaaat 
tectggaaatgtatctgccacctcttaccaaccagtgtgagctgacttcagcacaccactgagacagattacacgatagacagatagatagatagatagatagatagatagatagatagatagatagatattga 
tagagaacatgttgtgtctatatacatatccaatgacacacagtataatcatgttttaaattatgccaaaagtactgattatatcaagcccagtaaacataatgctacacagcacccattchttaattetgate, 
fictaaaattataaccttcctttgatattaataaaatgttatgaacttgcaggagaaggaaggagt tgaaagaaggaaaatataaagggtgatccacagtgaaaatcagacctacacaagecacaaaaaaattgt 
ctcattttcccaaggaattcaaagtcctgtgatacacattacaaaagatgacactttgtaaatattttgaacttctatgatccaatttcaacttcattggtccaatttgtcaaactattttectctgagcacta 
gattcatgggcaatttaatctotgaatattttctgtgtcataagtgtgcttgaaaagaggcettgtagtaaacagatabebebacaaaatttgatgcttctgcaaccataataattttatctataaatgegtata 
tacatatatatatatatatatatacacatacactcacatgcagatatttacatatatagatattctcttttccttttataaaatcaaagtctaggctcaattggaaaatggaaatatttgcaccaaaagaaaaa 
agtgccattgttgtgatcaatggagtaaagctggggtatagtagaggctggggaatccctgceccatgttctctccaaggecacagettgcacaccgtgaatggtggecccactggcetccatggagtgtgecttc 
acgcgggggaatgtccttgtccaacacgetgtaatctactccatgggaactgactgtcctgcettttcacaacat tgagaaacaggceccagagectgtctcatgtgcecaggtgcagtcacattattgttaaaata 
aagtggcttctgacaaat cagbbeeucaaaatagcatcctatgtacttttcatttcttccaacaacectcaggececettttgecagttttctcceccaggggagattttcaagactactggagaaaaatctgt 
+ QBBBlacatttgctctgagaaaaaaaaaaaaaaaa 


TNFRSF13C 


cagggagccggcaggaggtggcccctgcecctccctctggacccccagccaggggcttggaaatcaaattcagctcttcactccagcatgcacatgccctctttctgggacc 
aggctaactctgcagaagcacagacactacagaccacagcattcagcccccatggagtttggtgtgcttgcectttggcttcagacctcaccatctttgacagcccttgaag 
gtggtagcccagctcctgttcctgtgccttcaaaaggctggggcactatg agaccgcttttaaaatggggaaggcaccattaagccaaaatgaatctgaaaaaag 
accaagtgggaggatgcctaatttgtagttgggtggbtteetgtg ga gagacgagtcttgctctgtcgccaggce 
tggagtacagtggcgcgatctcgcctcactgtaacctccgactccttggttcaagtgattctcctgcctcagcectcccgaatagctgggattacaggcacgcaccaccaca 
gecagctaabbettgeatttetagtagagacgggatttcaccatgttggccaggatggtctcgatctcctgacctcgtgatecgcectgctcggectcccaaagtgctggg 
attacaggcatgagccaccatgeccggectt gBteEgegeEELggEettttettgagacagggtcttgcetgtatcacgcaggctggagtgcaacgatgggtccccagcetaat 
tgcagectgggctcaagceaatctttctgettcagectectgagtaggtgggactacaggtgctccaccttgcccagctaaBbeEEELEEEELE EEE ttt eet gagacagag 
tctcgctctgtcgcectaggctggagtgcagtgacgcaatttcggctcacttcaagctccgcectcctgggttcacaccattctcctgcctgagcctcctgagtagctgggac 
tataggcgcccgccatcacgcccggctaat agtagagacggggtgtcaccgtattagccaggatggtctcgatctcctgacctcgtgatccgcccg 
ectcagectcccaaagtgctaggattacaggcegt gagecaccgtggctggccBEbEEEEEELeEEaaagacagggtctcggtatgttgcccaggecagtctcaaactecctg 
gcctcaggcagtcctccccgaagggctgggattataggcaggaggcgcecgggcectggectctatttctttaatacctgccctcagagccgtcctgcttttaaactgatgca 
aatggcagcagtgaataatcaattcatgggttcgcettgcetttgggatttagtacaagaBeaLee toe aa caaaggagaaaagattgagctggattttgetgattct 
caggtatctgtaccccagggtcaaccctgagattggcccaatctgccagaacctaagatgccattttgaccccagtggcagcttccagagccacaagcctgcatttgggag 
gaggtgcgcagcctccagggctggactccaaactcgagctgaaggaacatgctggcaggtagctctctcagggtccacctggggttccagctgctccaagacacagcacat 
ttagccataaacatggaagggctgggacctacaggaggctcctttgtgtagaggccagggcaggcgtgtgaccttgaaccaggtggacatgaggtcccttgaactcccttc 
tcetctctggcttgttcccctgaagcactggccccccagtgctgggcttatttggccatcctgtgccagecccctggtcacccaccgtcacagtggaagctgctccctcctc 
ttacaatggtgttccttgctctggggtctggggacaagaccatctctttccactcctggacgacactgtgcactcagtcagecgcectcctacctgccctcctgcacagggt 
cagaacctcctgacctggacttcgccctgggcagcttccttggaacttgttgcagcttagcatctttggaagggagcactggactaggagtcagaaggtggagttctgggt 
gctgttttcccactcttgggccctgggcatgttctgttctgtgcatcagatgttttctgcctgcctccccctgagctgccaccagagccacctgtagacccctgcttctcc 
tgcccctcctgtcccttgccaggectctggactcagtgcttcctgtcccaaggacctgttgtgtgcccagcttggaagacagtggccactcacatccaggtctctgcctgg 
ctatttgctccatgaacccatctcagatgcccctgcccccagggagccaggtgacagggtgaggtcaagtgacaagcagtaggtgtgggagaggtttggtggggtcaggcc 
cacatgggcccagccatccactccccttccatctggagttgggggagtctgagctgctcctggccaccccatctgccccaccccatcttttcctggccaccccagctccca 
ecccatcttctcctggccaccccatctcccaccccatcttctcttgacaccccatctcccaccccatcttctcctggccaccccatctcccaccccatcttctcctggcca 
ecccatcttctcctggccaccccatctcccccaccccatcttctcctggacacccccatctcccaccccatcttttgctacagccagggttagctcagcaggtgaaaaccc 
cgagggtgggtgaaacccctctggggctcagacatgcaaaccttgggcatctctctgtcccagctggccccgccagcecggtaggaagtttcccctgagttctcag| 
GtEctgaaaaatgaggggttgtatgcaaggttctectectggectgtggtccccagagaagggcaggaaggaaccttagataattctcatatgcatttaacagacgaggaa 
actgagacccagagccgtcacatcaatacctcatttgatcttcataagagcacctggaggaggggggtggggtgBbtgegettgettaaabtttettttgegaaaaaaatg 
aagataggcattttgtagacaatctggaagttctggaccggaatccatgatgtagtcagggaagaaatgacccgtgtccagtaaccccaggectcgagtgtgtggtgtabE 

acataattgtaatcattctatacatacaaattcatgtcttgaccatcatattaatatttggtaagtttctctctctttagagactccac Emmet tttcaacat 
ggtaaggttttccacctgggcatttggatactttggbtteegetgttgtggecaa 


TSPAN13 


gaaaacaaggaagabeecettecgtattatgatcttgttcacbeberguaabeetetgeeaagctccatttgccagtttaaggaaggaaacactatctggaaaagtaccttattgatagtggaattatatabee 
BRactctatgtttctctacat gBbbeeebetbeccgt tgctgaaaaatatttgaaacttgtggtctctgaagetcggtggcacctggaatttactgtattcattgtcgggcactgtccactgtggectttctta 
gcabttttacctgcagaaaaactttgtatggtaccactgtgttggttatatggtgaatctgaacgtacatctcactggtataattatatgtagcactgtgctgtgtagatagttcctactggaaaaagagtgga 
aatttattaaaatcagaaagtatgagatcctgttatgttaagggaaatccaaattcccaaBeEEEEELggbeEetEbaggaaagattgttgtggtaaaaagtgttagtataaaaatgataatttacttgtagtc 
ttttatgattacaccaatgtattctagaaatagttatgtcttaggaaattgtggtttaabeeeegacttttacaggtaagtgcaaaggagaagtggtttcatgaaatgttctaatgtataataacatttacctt 
cagectccatcagaatggaacgagttttgagtaatcaggaagtatatctatatgatcttgatattgttttataataatttgaagtctaaaagactgcabeeeeaaacaagttagtattaatgegttggeccacg 
tagcaaaaagatatttgattatcttaaaaattgttaaataccgttttcatgaaatttctcagtattgtaacagcaacttgtcaaacctaagcatatttgaatatgatctcccataatttgaaattgaaatcgta 
ttgtgtggctctgtatattctgttaaaaaattaaaggacagaaacctttctttgtgtatgcatgttt cPanel otaatggaagaattgatcgatg 


Extended Data Figure 4| 3’ UTR isoforms that encode proteins using ITGA1, TNFRSF13C and TSPAN13. Red, ApA signals. Blue, uridine-rich 
UDPL contain uridine-rich elements. Shown are the 3’ UTR sequences of elements with the potential to be HuR-binding sites. 
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Extended Data Figure 5 | Local recruitment of SET to the site of translation 
is required for UDPL. a, Western blot of cells used in Fig. 3b shows the amount 
of overexpression achieved by transfection of MS2-mC-SET or MS2-mC-HuR 
(for constructs, see b). Left, anti-SET detects endogenous expression of SET as 
well as overexpressed SET. Right, anti-HuR detects endogenous HuR and 
overexpressed HuR. Actin was used as loading control. Anti-HuR and anti-SET 
were used on the same blot. Actin as loading control was performed once. The 
marker is shown in kDa. Asterisk indicates unspecific band. mC, mCherry. 

b, The top construct depicts GFP-TM-SU (Fig. le) and the bottom construct 
shows a fusion of MS2 coat protein (MS2), mC (red) and HuR or SET, 
respectively. Overexpression of HuR or SET compared with expression of MS2- 
mC alone does not change surface or total GFP expression, when co-transfected 
with GFP-TM-SU (without the addition of MS2-binding sites to the SU 
isoform) as shown by FACS analysis. Surface expression (top) and total 
expression (bottom) in HEK293 cells are shown. Values for MFI are shown in 
parentheses. Unstained cells are shown in grey. Representative histograms from 
n = 2 experiments are shown. c, FACS analysis of cells used in Fig. 3b. MS2- 
binding sites (MS2-BS, RNA stem loops) were added to GEP-TM-SU (and the 
proximal polyadenylation signal was mutated) to obtain GFP-TM-SU-MS2- 
BS. Transfection of MS2-mC-HuR (left, dark purple line) or MS2-mC-SET 
(right, dark purple line) increases surface GFP expression compared with 
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transfection of MS2-mC (light purple line), when GFP-TM-SU-MS2-BS is 
co-transfected. Thus, tethering of HuR or SET to the short 3’ UTR of GFP- 
TM localizes GFP to the cell surface without changing total GFP expression. 
Histograms are shown as in b. Representative histograms from n = 5 
experiments are shown. d, As inc, but tethering was impaired by omission of 
the MS2 coat protein. Histograms are shown as in b. Representative 
histograms from n = 2 experiments are shown. Summary of the tethering 
experiment: To tether SET or HuR to the 3’ UTR (which brings it close to the 
site of translation through the scaffold function of the 3’ UTR), we added 
MS2-binding sites to GFP-TM-SU c, MS2-binding sites are derived from the 
bacteriophage MS2 and form RNA stem loops. The capsid protein of MS2 
(here, called MS2) specifically recognizes these MS2 stem loops. Constructs 
were generated containing MS2 fused to mC and then either HuR, SET or 
with no further coding sequence (Fig. 3b). Co-expression of these constructs 
with the construct containing the short UTR and MS2-binding sites results 
in recruitment of SET or HuR to the short 3’ UTR of GFP-TM. The cells that 
express MS2 fused to only mC localize GFP to the endoplasmic reticulum, 
but constructs containing MS2 fusions to HuR or SET localize GFP 
primarily to the cell surface (Fig. 3b and Extended Data Fig. 5c). Omitting 
either MS2 or the MS2-binding sites from the experiment abrogates surface 
localization (Extended Data Fig. 5b, d). 
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Extended Data Figure 6 | CD47 contains at least two SET-binding sites in its 
cytoplasmic domains. a, FACS analysis of surface GFP expression after 
transfection of GFP-TM-LU (dark blue line) and GFP-TM-LU constructs 
containing a single point mutation in the cytoplasmic C terminus (light blue 
line), K290A (left), K297A (middle), K304A (right) in HEK293 cells. Partial 
destruction of a single SET-binding site results in up to 37% reduction in GFP 
surface expression. Values for MFI are shown in parentheses. Unstained cells 
are shown in grey. Representative histograms from n = 5 experiments are 
shown. b, FACS analysis of GFP expression after transfection of GFP-TM-LU 
(dark blue line), GFP-TM-LU containing a mutation of the SET-binding site in 
the C terminus (K290A, K304A; 2Km; light blue line; left), containing a deletion 
of the C terminus (AC; light blue line; middle panel), or destruction of both 
SET-binding sites (AC combined with K163A, K166A, K175A; ACL; light blue 
line; right). Surface (top) and total (bottom) expression is shown in HEK293 
cells. Values for MFI are shown in parentheses. Unstained cells are shown in 
grey. Representative histograms from several experiments are shown (2Km, 
n= 3; AC, n= 10; ACL, n= 4). 
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Extended Data Figure 7 | CD47 protein has different functions depending 
on whether it was generated by the SU or LU isoform. a, Left, western blot of 
HEK293 cells after transfection of the indicated constructs shows GFP-CD47 
expression using an anti-GFP antibody. Actin was used as loading control. 
Right, as in left panel after transfection of CD47-SU and CD47-LU into 
HEK293 (left) or JinB8 cells (right). GFP-CD47 expression was quantified after 
normalization with respect to actin using Image J. Shown is the fold change in 
GFP-CD47 expression of CD47-SU after setting CD47-LU to 1. b, The 
experiment is similar to Fig. 3b and Extended Data Fig. 5c, but here the 
constructs containing the full open reading frame of CD47 were used. FACS 
analysis of GFP expression after transfection of CD47-SU-MS2-BS. Co- 
transfection of MS2-mC-HuR (left, dark purple line) or MS2-mC-SET (right, 
dark purple line) increases surface GFP expression compared to co-transfection 
of MS2-mC (light purple line). Surface expression is shown in non- 
permeabilized HEK293 cells. Values for MFI are shown in parentheses. 
Representative histograms from n = 3 experiments are shown. Unstained cells 
are shown in grey. c, Left, FACS analysis of GFP after transfection of constructs 
containing a signal peptide and GFP fused to the open reading frame of BAFFR 
and either the long 3’ UTR (BAFFR-LU, dark blue line) or the short 3’ UTR 
(BAFFR-SU, light blue line) in HEK293 cells. Surface (top) and total (bottom) 
GFP expression is shown. Values for MFI are shown in parentheses. 
Representative histograms from n = 3 experiments are shown. Unstained cells 
are shown in grey. Right, as in left panel but for CD44. d, Table showing the fold 


increase in surface GFP expression mediated by the LU isoform compared with 
the SU isoform. Top row shows values of constructs without the ECD and 
bottom row shows values of constructs containing the full coding regions of the 
indicated proteins. The fold increase in surface GFP expression was calculated 
from MFI (LU)/MFI (SU). The contribution of the ECD domain for surface 
expression of BAFFR is 1.2-fold (3.8/3.1). e, FACS analysis of 
carboxyfluorescein succinimidyl ester (CFSE) uptake in macrophages. 
Macrophages were co-cultured without (grey) cells or with cells that were pre- 
treated with CFSE and expressed high or low amounts of surface CD47 (data 
not shown). The experiment shows that the macrophages phagocytose the cells 
depending on their CD47 surface expression levels. A representative histogram 
from n = 2 experiments is shown. f, The fraction of surviving cells (TO-PRO3 
negative) as measured by FACS analysis at day 3 (d3) after increasing doses of 
y-irradiation is shown for Jurkat, JinB8 (CD47 ’~) and the GFP* fraction after 
nucleofection of JinB8 cells with either CD47-SU or CD47-LU. The values were 
obtained from the same experiment as shown in Fig. 4e, but here the values 
were calculated using all GFP-positive cells. Shown are the values for 

mean ~ s.d., n = 3 biological replicates. Gy, Gray. g, The fraction of surviving 
cells (TO-PRO3 negative) as measured by FACS analysis at day 3 (d3) after 
increasing doses of y-irradiation is shown for Jurkat, JinB8 (CD47 ~~) and the 
GFP* fraction after nucleofection of Jurkat cells with sh2 CD47-LU. Shown are 
the 20% of cells with the highest GFP expression (green). Shown are the values 
for mean + s.d., n = 3 biological replicates. Gy, Gray. 
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Extended Data Figure 8 | HuR, SET and RACI are widely and highly targets’ consist of the union of HuR targets identified previously””. Membrane 


expressed. a, The mRNAs of proteins necessary for UDPLare ubiquitouslyand _ proteins consist of all the proteins that contain the tag “membrane” using gene 
highly expressed across cell lines (left) and tissues (right). Shown are values for _ ontology analysis. The fraction of membrane proteins found is consistent with 
transcripts per million (TPM). The median abundance levels of all expressed _ the fraction of membrane proteins found in yeast”. Fisher’s exact test shows no 
genes in the data sets are shown as dashed lines. ELAVL1 encodes HuR. The __ enrichment or depletion of membrane proteins among the HuR targets. 

data set from ref. 3 was analysed to obtain the TPM values. b, Here, ‘HuR 
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Extended Data Figure 9 | All tested UDPL candidates have potential SET- _ studied. The TMDs are shown in green and the positively charged amino acids 
binding sites in their cytoplasmic domains. Shown are the amino acid in the cytoplasmic domains, indicating potential SET-binding sites, are shown 
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Synthesis and applications of RNAs with position- 
selective labelling and mosaic composition 


Yu Liu’, Erik Holmstrom”, Jinwei Zhang*, Ping Yu‘, Jinbu Wang', Marzena A. Dyba*, De Chen®, Jinfa Ying°, Stephen Lockett’, 
David J. Nesbitt”, Adrian R. Ferré-D’Amaré’, Rui Sousa’, Jason R. Stagno! & Yun-Xing Wang! 


Knowledge of the structure and dynamics of RNA molecules is crit- 
ical to understanding their many biological functions. Furthermore, 
synthetic RNAs have applications as therapeutics and molecular 
sensors. Both research and technological applications of RNA would 
be dramatically enhanced by methods that enable incorporation of 
modified or labelled nucleotides into specifically designated posi- 
tions or regions of RNA. However, the synthesis of tens of milli- 
grams of such RNAs using existing methods has been impossible. 
Here we develop a hybrid solid-liquid phase transcription method 
and automated robotic platform for the synthesis of RNAs with 
position-selective labelling. We demonstrate its use by successfully 
preparing various isotope- or fluorescently labelled versions of the 
71-nucleotide aptamer domain of an adenine riboswitch' for nuclear 
magnetic resonance spectroscopy or single-molecule Forster res- 
onance energy transfer, respectively. Those RNAs include molecules 
that were selectively isotope-labelled in specific loops, linkers, a helix, 
several discrete positions, or a single internal position, as well as 
RNA molecules that were fluorescently labelled in and near kissing 
loops. These selectively labelled RNAs have the same fold as those 
transcribed using conventional methods, but they greatly simplify 
the interpretation of NMR spectra. The single-position isotope- and 
fluorescently labelled RNA samples reveal multiple conformational 
states of the adenine riboswitch. Lastly, we describe a robotic plat- 
form and the operation that automates this technology. Our selective 
labelling method may be useful for studying RNA structure and 
dynamics and for making RNA sensors for a variety of applications 
including cell-biological studies, substance detection’, and disease 
diagnostics**. 

The ability to synthesize RNAs with selectively labelled regions or 
positions is important for the following reasons. First, NMR is a 
powerful technique for probing RNA structure and dynamics, but is 
limited by severe signal overlap in the spectra of RNA. This could be 
overcome by region- or position-specific isotopic labelling of RNA so 
that spectral signals from critical residues could be observed without 
interference from the rest of the molecule. Second, Forster resonance 
energy transfer (FRET) experiments measure distances between fluor- 
ophores introduced site-specifically into macromolecules and have 
been used to study RNA structure and dynamics. Optimal placement 
of FRET pair fluorophores usually requires incorporation at RNA 
chain-internal positions and is difficult with current methods of 
synthesis, particularly with large RNAs. Third, position-specific 
incorporation of nucleotides derivatized with heavy or anomalously 
scattering atoms into RNA may aid crystallographic phase determina- 
tion. Fourth, RNAs synthesized with modified nucleotides to enhance 
functionality or RNase resistance can exhibit enhanced in vivo effi- 
cacy’. Lastly, RNA aptamers, labelled with fluorophores at detection- 
sensitive positions, could have broad applications as sensors because of 


their high affinity and specificity for a variety of substrates, including 
molecular cancer markers’. 

Solid-phase chemical synthesis is limited to short RNAs (=60 nucleo- 
tides (nt))®” and isotope-labelled versions of these chemicals are not 
commercially available. Solution-phase transcription with bacteriophage 
T7 or SP6 RNA polymerases (RNAP3) allows synthesis of longer RNAs’, 
but is limited to either uniformly labelled samples or those labelled by 
base type’. Incorporation of modified nucleotides by T7 RNAP is limited 
by poor processivity during initiation. However, once the RNA has been 
extended to at least 10 nt, elongation is highly processive'”’’. Elongation 
can be stalled by omitting the nucleoside 5’-triphosphate (NTP) required 
for transcription beyond a specified template position, and can be 
restarted by addition of the missing NTP(s)'*’*"*. This pause-restart 
method enables synthesis of RNAs with region- or position-specific 
modifications by using solid-phase-coupled DNA templates and limiting 
combinations of NTPs, so that elongation can be stopped at specific 
positions and restarted with a new NTP mix (Fig. 1). A similar approach 
has been attempted for in vitro peptide synthesis’*. The pause-restart 
method has been used for mechanistic studies, although the synthesis of 
RNA using this method on a preparative scale has not been realized. 

To achieve this, we first coupled 5’-biotinylated DNA templates to 
streptavidin-agarose beads (Fig. 1a)'° and then incubated them with 
T7 RNAP andan NTP mix lacking cytidine 5’-triphosphate (CTP)’*”. 
This causes elongation to stall at +14, where the first CTP would 
otherwise be incorporated. After extensive washing with buffer, 
elongation was resumed by addition of a new NTP mix containing 
CTP. RNA synthesis can be repeatedly paused and resumed in this way 
to incorporate modified or labelled NTPs at desired locations (Fig. 1b). 
Once transcription extends past the position where specific labelling is 
desired, the reaction is terminated and re-initiation prevented by addi- 
tion of an NTP mix without GTP, addition of heparin, which prevents 
reinitiation’®, and a complete NTP mix, or quenching at 4 °C. The 
entire process can be repeated several times with the same bead-bound 
template to increase RNA yield. The probability of incorporating a 
wrong nucleotide occurs 1 in 20,000 (ref. 16). Development of the 
method—dubbed PLOR for position-selective labelling of RNA—is 
described in detail in Extended Data Figs 1 and 2. As the method 
requires multiple washing and NTP addition steps, we have developed 
a fully automated platform for performing PLOR (Fig. 1c; see Methods 
and Extended Data Fig. 2 for details of design and operations of the 
platform). We illustrate applications of this method using the 71-nt 
aptamer domain of an adenine riboswitch, riboA71 (Fig. 2a), which 
changes conformation upon binding adenine’. The sequence of 
riboA71 was from Vibrio vulnificus with the first stem sequence modi- 
fied to enhance the transcription yield’”’. This adenine riboswitch 
controls gene expression through translational activation’. The class 
of adenine riboswitches has been studied using various methods!””*. 
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Heart, Lung and Blood Institute, Bethesda, Maryland 20892, USA. “Structural Biophysics Laboratory, Basic Science Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer 
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Figure 1 | The PLOR in vitro transcription method. a, Solid-phase 
attachment of DNA templates: 5’-biotin labelled duplex DNA templates were 
coupled to 30-165 jim diameter streptavidin-agarose or neutravidin-agarose 
beads. An 18-nt spacer was inserted between the T7 promoter and the biotin 
moiety to limit steric interferences with polymerase activity. b, Schematic 


The crystal structures of the adenine-bound form of riboA71 
(Protein Data Bank (PDB) accession numbers 1Y26 and 4TZX) show 
loops 1 and 2 ina kissing loop interaction’. Selective labelling of these 
loops would allow study of their structure and dynamics in solution. 
Using PLOR, we synthesized RNAs in which only loop 1 (Lp1-CN) or 
loop 2 (Lp2-CN) was labelled with '°C/'°N isotope so that only signals 
from those regions were observed in heteronuclear correlation spectra 
(Fig. 2b, c). These RNA molecules fold in the same way as those 
synthesized using conventional in vitro transcription, as illustrated 
by superimposing their spectra on that of the fully '°C/'"°N-RNA 
(Extended Data Figs 4 and 5). These samples could be used for detec- 
tion of loop-loop interactions by recording various heteronuclear- 
edited/filtered nuclear Overhauser effect (NOE) spectra. We further 
demonstrated the utility of PLOR by '°C/'°N-labelling both loops 
(Lp1+2-CN) (Fig. 2d), showing that the spectrum of Lp1+2-CN 
is consistent with the superimposition of the Lp1-CN and Lp2-CN 
spectra. 

We also used PLOR to '°C/'°N-label the linkers that connect adja- 
cent duplexes: linker 2 (Lk2-CN) and stem 1 plus linker 1 (S1+Lk1- 
CN) (Fig. 2a). The use of selectively labelled RNAs greatly simplified 
the spectra (Fig. 2e, f), but overlaps were still present, as evidenced by 
the overlapping signals of A4, 5, 7, and 9 in the C8H8 region (Fig. 2f). 
Such overlaps are common for RNA duplexes because of similar chem- 
ical environments around the detected nuclei in the A-type duplex 
conformation. To overcome this, we used PLOR to generate a sample 
(S1+Lk1-H) in which stem 1 and linker 1 were protonated, and the 
rest of the molecule was deuterated (this RNA was not C/!°N- 
labelled). By recording a two-dimensional NOE spectrum of 
S1+LkI1-H, and performing an NOE-walk in the H1’-H6/H8 region 
(Fig. 2g, top panel), the overlaps were partly resolved and proton 
chemical shifts assigned. Without partial deuteration, interpretation 
of this region of the spectrum would be impossible (Fig. 2g, bottom 
panel). With the increasing availability of high-field NMR spectro- 
meters, the use of selectively protonated/deuterated RNA samples is 


Add NTPs __ Gently rotate 
15 min at 37 °C__ extraction (SPE) to 


LETTER 


Wash solid 
phase 


Solid phase 


filter out liquid phase 


* 


\ V/ x3 ) 


Add NTPs Gently rotate SPE Wash solid phase 
10 min at optimal 
temperature 
Cycle 2 -—»> N 
Cycle 1 
Termination 


Add NTPs 


Gently rotate Drop 
12 min at optimal temperature te 
temperature to4°C Collect liquid 
phase 


illustration of the three transcription reaction phases of the PLOR synthesis. 
The number of cycles, N, during the elongation phase depends on the RNA 
sequence and the desired labelling scheme. c, The robotic platform used to 
perform automated PLOR synthesis. 


one of the most effective strategies for RNA structure determination’. 
While all labelling schemes described could, in theory, be performed 
using segmental labelling by ligations, such a method is impractical for 
placing isotope-labelled residues in multiple discrete positions. This 
can, however, be readily achieved using PLOR. Figure 2h shows the 
spectrum of another sample generated by PLOR, 4nt-CN, where only 
residues at positions A21, C38, U39, and G60 were '°C/'°N-labelled. 
RNAs may exist in multiple conformations, and while NMR is 
suitable for detecting such conformers”””’, interpretation of data is 
often complicated by crowded signals. Coexistence of conformers 
can be unambiguously identified by monitoring a single residue at a 
critical structural location. It has been shown” that riboA71 ther- 
modynamic data are best interpreted by a four-state model, and that 
its conformation is affected by ligand and Mg’*. Inspection of the 
crystal structure of riboA71' reveals the U39 N3-H3 imino group 
forming a hydrogen bond with N3 of the adenine ligand (Fig. 3a). 
We used PLOR to incorporate a single 'H/'*C/'°N-labelled uridine-5’- 
triphosphate (UTP) at this position (U39-CN). In the absence of aden- 
ine we detected multiple weak cross-peaks for both the H6—C6 and the 
N3-H3 imino groups in the °C and '°N heteronuclear single- 
quantum correlation signals of U39, respectively, indicative of multiple 
conformations (Fig. 3b, c). At a 1:1 RNA:adenine ratio we detected 
four distinct cross-peaks (Fig. 3b, c), implying coexistence of at least 
four conformations of the adenine-binding pocket, in agreement with 
the four-state model”®. This type of information would be difficult to 
obtain unambiguously using a uniformly labelled sample, where 
extensive overlap and cross-peaks from other residues would obscure 
the weak cross-peaks associated with alternative conformations of the 
complex. Nevertheless, it is noteworthy to point out that a slightly 
different multi-state model has also been proposed on the basis of a 
study of an adenine riboswitch aptamer domain of a slightly different 
sequence using NMR and a uniformly labelled RNA sample”. 
Single-molecule FRET (smFRET) is a powerful method for studying 
conformation change and has been used to study the adenine 


18 JUNE 2015 | VOL 522 | NATURE | 369 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a ie 134.0 {b Lp1}ic Lp2 ||d Lp1+2 
e . e 
ca +f ij G25 a6 O78 G25 go 46 
p ucy p2 
Bi 35 U 40 = a 
G 30 C45 E M21. & RAN. .aifas 
U- ~ UUUGGGA Ade CAAGAG “i ¢ e A54 ASA 
A led ITT LEl & * C48 cag 
y ,UAAUGCU GUUCUC, -A A ‘ 
G 1 A AY 55°. A 2 U22 ASS * Gag @ | [A53 cag . 
ae Lki Ay U ®. Us1 ust 
10 A~U Ade 
U~A 142.04 &U24 Ade? @ 50 U4—e © 
A—U65 C6H6/C8H8 U50 
Gc S 
—5A—U 
O Au 
A54 
ae 152.0 . 123 
G-c 52.07 ade ASS 7 
G-c a2 oT aae ASS 
: 1 
70 80 7.0 
oe 
618 au, 2 |[t 256 S1+Lk1 134.04 ant 
: G60 
: G2 Pa 
. A40 Gt pgs A45,7,9 
AY A21 
“| use “* U8 rt 
- re uso. 
2 C38 : + C38 
Mii ° Ut0 
* C42 
Ad 
142.0 Ader © 86 oe 142.0, © A%° 
3} Al2 
AQ” 
AZ pad 
fA 
152.05 ade A4o |} ati 2 Ale ‘| 152.04 Ade 
e * e Ade 2A21 
T i T T T T T 
8.0 7.0 8.0 7.0 6.0 8.0 7.0 


Figure 2 | PLOR isotopical labelling of a 71-nt RNA. a, The secondary 
structure of riboA71. The colour-coded sequence depicts the various regions of 
the RNA that were selectively labelled with '°C/'°N or 7H NTPs. PLOR was also 
used to label residues at four discrete positions (green asterisks), or a single 
residue (black arrow). b-f, h, Aromatic regions of 1H3C-transverse relaxation 
optimized spectroscopy (TROSY) spectra of samples with !°C/1°N labelling in 


(b) loop1 (Lp1), (c) loop2 (Lp2), (d) Lp1 plus Lp2 (Lp1+2), (e) linker2 (Lk2), 
(f) stem 1 plus linker 1 (S1+Lk1), or at (h) four discrete positions (4nt). 

g, Comparison of 'H NOE spectra. The bottom spectrum was recorded using a 
fully protonated sample; the top spectrum shows the same region recorded 
using a sample in which only the stem 1 and Lk1 were protonated. 
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Figure 3 | Application of PLOR in dissecting the co-existence of multiple 
conformations using a singly labelled RNA sample. a, Illustration of the 
hydrogen bond formed between the adenine ligand and U39 in riboA71 (PDB 
accession number 1Y26). b, ‘H'°C-TROSY and c, ‘'H!=N-TROSY spectra 


riboswitch?***. We illustrate that PLOR can be used to incorporate 
modified nucleotides tolerated by T7 RNAP. The crystal structure of 
adenine-bound riboA71 shows loops 1 and 2 in a kissing-loop inter- 
action, but there is no direct information on loop-loop interactions in 
the absence of adenine. We used PLOR to generate U24Cy3-C55Cy5- 
B RNA, which contains a 3’-biotin and nucleotides tagged with fluor- 
escent Cy3 at U24 (loop 1), and Cy5 at either U65 (stem 1) or C55 
(loop 2; Cy5 was placed at the end, rather than in the middle, of loop 2 
to avoid steric interference with respect to loop—loop interactions; 
Fig. 4a). smFRET measurements with this RNA in 0 1M adenine yield 
an Eprer histogram (Fig. 4b) that can be approximated as the sum of 
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recorded for '°C/"°N-singly-labelled (U39) riboA71 titrated with adenine. Four 
cross-peaks in both H6-C6 (a-d) and imino (a’-d’) regions are indicative of 
four distinct conformations at a 1:1 RNA:adenine ratio. 


two broad Gaussian curves, indicating the presence of at least two 
populations of molecules with distinct loop-loop interactions. 
Approximately 20% of the population is in a high-FRET (~0.92) con- 
formation consistent with the U24-C55 distance (<20 A) observed in 
the crystal structure. Upon addition of ligand, this conformation 
becomes increasingly populated (Fig. 4b, c). To further evaluate the 
utility of PLOR for incorporation of multiple modifications at specific 
sites, we imaged single U24Cy3-C55Cy5-B molecules (Fig. 4d), and 
not only confirmed the presence of the three RNA modifications, but 
also showed from the individual time traces (Fig. 4e) that adenine is not 
required to form the high-FRET conformation”*”*. 
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Figure 4 | Application of PLOR-generated constructs for smFRET. 

a, Secondary-structure diagram of riboA71 depicting the locations of the 
fluorescent labels. b, smFRET probability histogram for free diffusion 
experiments conducted in solutions containing 1 mM Mg’* and either 0 or 
100 LM adenine. About 1,000 fluorescent bursts were analysed for each 


PLOR allows efficient synthesis of milligram quantities of RNA 
(Figs 2 and 3, Extended Data Fig. 6 and Extended Data Table 2). 
Both the His-tagged T7 RNAP and the DNA templates are reusable, 
and the same batch of bead-attached DNA templates was used for all 
samples generated here. The DNA templates can be produced routi- 
nely and economically in micromole quantities by PCR”; all reagents 
are commercially available, and the technology is automated using a 
robotic platform. Extension of PLOR for other applications is easily 
envisioned. For example, variants of T7 RNAP have been developed 
that allow incorporation of nucleotides with modified ribose groups 
into RNA which can, among other useful characteristics, provide res- 
istance to RNases for in vivo applications®*°. PLOR may also be useful 
for X-ray crystallography by incorporating heavy-atom-derivatized 
nucleotides into the RNA at one or more desired positions for phase 
determination of X-ray diffraction data. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

DNA templates. Initial DNA templates used for riboA71 synthesis by the PLOR 
method were purchased (Avetra Bioscience). The 5’-biotinylated non-coding 
strand, 5’-TCTGATTCAGCTAGTCCATAATACGACTCACTATAGGGAAG 
ATATAATCCTAATGATATGGTTTGGGAGTTTCTACCAAGAGCCTTAAA 
CTCTTGATTATCTTCCC-3’, was annealed to its complementary strand (the 
coding strand) in buffer A (40 mM Tris-HCl, 100 mM K,SO,, 6 mM MgSO,, pH 
8.0). An 18-nt spacer before the T7 RNAP promoter (underlined) was inserted to 
reduce crowding on the bead surface and potential steric hindrance to the T7 
RNAP. Double-stranded DNA templates were also generated by touchdown PCR 
(TD-PCR) to abrogate nonspecific products*'. The commercially synthesized DNA 
strands were used as templates in the TD-PCR. The forward primer contained 
biotin at its 5'-end and the reverse primer contained two 2'-O-methylguanosine 
(mG) at its 5’ end to reduce non-templated nucleotide addition to transcripts”. 
The primer sequences were: 5’-/Bio/TCTGATTCAGCTAGTCCATAATAC 
GAC (Integrated DNA Technologies), and 5’-mGmGGAAGATAATCAA 
GAGT TTAAGGCTC (Integrated DNA Technologies). The reagents used in 
the TD-PCR reactions were as follows: 0.1 4M template, 10 14M primers, 200 
1M dNTPs, Taq DNA polymerase, and PCR buffer (50 mM KCl, 10 mM Tris- 
HCl, 2 mM MgCh, pH 8.4). The cycling program of the TD-PCR involved two 
separate phases: the touchdown phase and the PCR phase. During the touchdown 
phase, the annealing temperature was reduced from 75 °C to 45 °C over 20 
successive cycles. Each touchdown cycle, therefore, began with melting at 
95 °C for 30 s (5 min for first cycle), followed by annealing (temperature varies) 
for 45 s and elongation at 72 °C for 1 min. The PCR phase was a generic amp- 
lification stage of 25 cycles of 95 °C for 30 s, annealing at 50 °C for 30 s, and 
elongation at 72 °C for 1 min. 

The DNA, 5'-TCTGATTCAGCTAGTCCATAATACGACTCACTATAGGGG 
ACGGCGGCAGCGCTGTCTAGCTGCGGGCATTAGACTGGAAAACTAGT 
GCTCTTTGGGTAACCACTAAAATCCCGAAAGGGTGGGCTGTGGTGAC 
CCTCCG-3’, and its complementary strands were used as templates in TD-PCR 
for turnip crinkle virus (TCV) samples. The sequences of the primers used in the 
TD-PCR for TCV samples were, 5'-/Bio/TCTGATTCAGCTAGTCCATAA 
TACGACTCACTA TA, and 5’-CmGGAGGGTCACCACAGC. The templates 
and primers were obtained from Integrated DNA Technologies. 

Solid-phase attachment of DNA templates. For this study, commercial agarose 
beads (30-165 ttm diameter), coated with either streptavidin or neutravidin 
(Thermo Fisher Scientific), were used as the solid-phase support. In our hands, 
the neutravidin-coated agarose beads appeared to have a better retention of the 
biotinylated DNA templates and, therefore, were used as the solid-phase support 
in most cases. The double-stranded DNA template (0.8 jumol) was incubated with 
neutravidin-coated agarose beads (8 ml) in buffer A for ~3 days at 4 °C. The bead- 
attached templates were washed by repeated rinsing and passing through a Pierce 
Centrifuge Column with a ~30 um average pore size (Thermo) to remove non- 
bound DNA templates, and were stored at 4 °C. Approximately 80% of the DNA 
template was attached to the beads on the basis of ultraviolet-absorbance detec- 
tion. The bead-attached templates were stable for an extended period, and were 
reused for multiple rounds of synthesis for this study. 

Detailed experimental procedure for PLOR. A total of eight RiboA71 samples 
(Fig. 2a) with various isotopic labelling schemes were prepared using PLOR. The 
detailed recipes for each of the eight syntheses are provided in Extended Data 
Table 1, including the amounts of NTPs, which are given in molar ratios relative to 
the total amount of the DNA templates used in each cycle of synthesis. The NTP 
ratios were chosen on the basis of the number of each specific type of nucleotide in 
an individual segment synthesized in a particular cycle. Note that much higher 
ratios of NTPs were used in the initiation phase (Extended Data Table 1). This is 
because of the intrinsic nature of the T7 RNAP-catalysed reaction, where a large 
amount of abortive products are generated and relatively large amounts of NTPs 
are required to complete the slow initiation phase of the reaction*’. The 96-fold 
ratios of ATP and GTP, and 9.6-fold ratio of UTP for initiation, were optimized 
specifically for the adenine riboswitch RNA samples on the basis of its residue type 
composition. Here we use the synthesis of the '*C/'°N-labelled Lp1-CN as an 
example to illustrate the PLOR procedure. T7 RNAP was mixed with DNA- 
attached agarose beads at 37 °C for 10 min, followed by addition of ATP, GTP, 
and UTP and incubation at 37 °C for 15 min with gentle rotation of the reaction 
vessels. The gentle mixing, as opposed to vortexing, was necessary to prevent 
formation of foam/bubbles by T7 in the reaction vessel. The total reaction volume 
was 40 ml and the concentrations of T7, DNA, ATP, GTP, and UTP were 20 pM, 
20 uM, 1.92 mM, 1.92 mM, and 192 uM, respectively, in buffer B (40 mM Tris- 
HCl, 100 mM K,SO,, 6 mM MgSO,, 10 mM DTT, pH 8.0). The reaction mixture 
was filtered and rinsed five times using buffer C (40 mM Tris-HCl, 6 mM MgSO,, 
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pH 8.0) (Extended Data Fig. 1). In the first elongation cycle (Extended Data Fig. 1, 
blue), 40 uM ATP, CTP, and UTP, dissolved in buffer D (40 mM Tris-HCl, 6 mM 
MgSO,, 10 mM DTT, pH 8.0), were added and the reaction was allowed to proceed 
for 10 min at 25 °C, followed by filtration and washing three times with buffer 
C. The exact concentrations of NTPs used for the remaining elongation cycles 
were as follows: 20 uM '°C/'°N-ATP and 20 iM '3C/'°N-GTP (cycle 2); 20 uM 
C/°N-ATP and 40 uM °C/N-UTP (cycle 3);40 1M '°C/"°N-GTP (cycle 4). In 
the termination step (Extended Data Fig. 1, red), 200 14M ATP, 220 uM CTP, 140 
uM GTP, and 340 uM UTP were added to the reactor and incubated at 25 °C for 
~12 min. The filtered RNA product was then collected and stored at 4 °C or 
—20 °C for long-term storage, and the DNA-attached agarose beads were rinsed at 
least five times and stored at 4 °C for future use. The final RNA products were gel- 
purified using 15% polyacrylamide gel electrophoresis (PAGE)-urea gels before 
being used for NMR data collection. 

The procedure of preparing the "N-H1-TCV* ("°N-labelled at 5’-end, shown 
in red in Extended Data Fig. 6a) is the same as described earlier. The total reaction 
volume used in the "N-H1-TCV was 30 ml, and the concentrations of T7, DNA, 
1SN-ATP, '°N-GTP, and '°N-CTP in the initiation were 20 1M, 20 1M, 0.64 mM, 
4.8 mM, and 0.32 mM, respectively. In the first elongation cycle, 20 1M ISN-CTP, 
20 1M '°N-GTP and 60 uM !°N-UTP were added in the reaction. In the termina- 
tion step, 0.4 mM ATP, 0.380 mM CTP, 0.5 mM GTP, and 0.360 mM UTP 
were added. 

NMR experiments. All NMR samples in this study were in a buffer containing 10 
mM KH,PO,, 30 mM KCl, 2 mM MgCl, pH 6.8. Adenine (5 mM) was added to all 
riboA71 NMR samples except U39-CN. Two-dimensional H'H NOESY, 'H'°N 
TROSY°5, and ‘HC TROSY spectra were collected at 25 °C for riboA71 and 15°C 
for the 104-nt TCV RNA on Bruker spectrometers operating at a proton frequency 
of 850 MHz or 700 MHz and equipped with a triple-resonance cryo-probe. All 
spectra were processed and analysed with nmrDraw** and NMR View] (One 
Moon Scientific). The concentrations for Lp1-CN, Lp2-CN, Lp1+2-CN, Lk2- 
CN, $1+Lk1-CN, 4-nt-CN, $1+Lk1-H, U39-CN, 71-nt-CN, '°N-H1-TCV, and 
1SN-104-nt-TCV (fully °N-labelled TCV) were 0.5, 0.3, 0.2, 0.35, 0.4, 0.3, 0.4, 0.35, 
0.5, 0.4, and 0.5 mM, respectively and the sample volumes were ~250 tl in 
Shigemi tubes. 

Fluorescent samples generated by PLOR. Five fluorescent samples were pro- 
duced by the PLOR method: U24Cy3-C55Cy5, U24Cy3-C55Cy5-B, U24Cy3- 
U65Cy5, U24Cy3-U65Cy5-B and U24A555-U65488-B (Extended Data Table 3). 
As an example, a detailed description of the U24Cy3-C55Cy5 synthesis is pro- 
vided. DNA and T7 concentrations used in the preparation were 5 1M in 3 ml 
transcription buffer (buffer B). Then, NTPs were added as follows: 0.48 mM ATP, 
0.48 mM GTP and 48 uM UTP (initiation); 10 14M ATP, 10 tM CTP and 10 pM 
UTP (elongation, cycle 1);5 1M ATP and 5 1M GTP (cycle 2); 5 uM UTP (cycle 3); 
5 uM ATP and 5 uM 5-aminoallyl UTP (TriLink) (cycle 4); 5 uM ATP, 30 pM 
GTP, and 30 uM UTP (cycle 5); 15 uM ATP, 15 uM CTP, and 5 tM UTP (cycle 6); 
5 uM ATP and 10 1M GTP (cycle 7); 10 uM CTP and 10 [tM UTP (cycle 8); 15 pM 
ATP and 5 uM Cy5-CTP (cycle 9, at 37 °C); 5 uM CTP, 5 uM GTP, and 15 uM 
UTP (cycle 10); 10 pM ATP, 20 1M CTP, and 25 uM UTP (termination). Cy5- 
CTP was from GE Healthcare. The RNA transcript was lyophilized and then 
dissolved in double-distilled HO. U24, which contained an aminoallyl group, 
was functionalized with Cy3 by adding a Cy3-NHS ester and 0.5 volumes of 0.3 
M sodium bicarbonate buffer (pH 8.3), followed by incubation for 1 h at 37 °C, 12 
h at 25 °C, and then 1 h at 37 °C. All incubations were performed in the dark. 
Homogeneity of dually labelled RiboA71 samples was assessed via 15% denaturing 
PAGE and analytical high-performance liquid chromatography before being used 
in free-diffusion smFRET experiments. 

smFRET experimental set-up. A home-built microscope system was used to 
acquire all of the single-molecule fluorescence data, as recently described else- 
where”. Briefly, alternating excitation*® of the donor (Cy3) and acceptor (Cy5) 
fluorophores is achieved using pulsed-laser sources at 532 nm (20 MHz, 10 ps) and 
635 nm (20 MHz, 90 ps), respectively. The interleaved pulses are directed into the 
back aperture of a 1.2 numerical aperture water objective, which is used in a 
standard epifluorescence configuration, both to focus the excitation source down 
to a ~250 nm diffraction limited spot and to collect the emitted fluorescence 
photons. The collected fluorescence is then directed out of the side-port of the 
microscope and focused through a 50 um confocal pinhole, which spatially defines 
the detection volume. The photons are then separated by both colour (donor, 
acceptor) and polarization (horizontal, vertical) before being directed onto one 
of four single-photon avalanche photodiodes, where the arrival time of each 
detected photon is recorded. 

Free-diffusion smFRET. After being deemed suitable, the fluorescently labelled 
RNA samples were diluted to a final concentration of ~100 pM in single- 
molecule imaging buffer (10 mM HEPES, 50 mM NaCl, 1 mM Mg**, 0-1 mM 
Ade, 2 mM Trolox, 5 mM 3,4-protocatechuic acid (PCA), 100 nM protocatechuate 
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dioxygenase (PCD), pH 7.5). The PCA/PCD oxygen” and the blinking suppres- 
sant, TROLOX”, were used together to enhance the photostability of cyanine- 
derived fluorophores (that is, Cy3 and Cy5). 

Surface-immobilized smFRET. The selectively incorporated biotin moiety per- 
mits surface immobilization of individual fluorescently labelled nucleic acids, as 
recently described elsewhere’. Briefly, the piezoelectric scanning stage of the 
experimental apparatus provides the ability to raster-scan an area of the sample 
(for example 15 jum X 15 jm), resulting in a false-colour image of the surface. 
The stage can then be used to position an individual fluorescence feature 
within the focus of the microscope objective, allowing continuous excitation 
and collection of fluorescence from a single molecule. FRET values are derived 
from the ratio of acceptor (A) fluorescence to the sum of both donor (D) and 
acceptor (A) fluorescence intensities (that is, A/[D + A]). These fluorescence 
intensities are background corrected and adjusted for various experimental arte- 
facts such as acceptor direct excitation, donor cross-talk on the acceptor channel, 
and differential quantum yields of the two fluorophores". 

Crystallographic analysis. Unmodified RiboA71 RNA synthesized using PLOR 
(Extended Data Table 1) was crystallized as previously described'”*. Briefly, for 
crystallization by vapour diffusion, a solution containing 700 tM RNA, 50 mM 
KOAc pH 6.8, 100 mM MgCl, 1 mM spermine, and 5 mM adenine was mixed 1:1 
(v/v) with a reservoir solution comprising 50 mM Tris-HCl pH 8.5, 100 mM KCl, 
10 mM MgC, and 30% polyethylene glycol (PEG) 400. Crystals grew to maximal 
dimensions of 250 jm X 50 jm X 50 jim in 1-2 weeks at 4 °C and were directly 
flash-frozen by plunging into liquid nitrogen. The structure of the PLOR-gener- 
ated riboswitch-adenine complex was solved by molecular replacement with 
PHASER using a published structure*® (PDB accession number 4TZX) as a search 
model. Initial solutions were subjected to manual rebuilding” interspersed with 
iterative rounds of rigid-body, simulated-annealing, and individual isotropic 
B-factor refinement using PHENIX”. Refinement statistics are summarized in 
Extended Data Table 4. The coordinates and structure factors have been deposited 
in Protein Data Bank under accession number 4XNR. 

Optimizing the transcription reaction. Fundamental to our method of the pre- 
parative T7-catalysed reactions is a novel stoichiometry concentration of reac- 
tants. This novel scheme is counter-intuitive to the conventional in vitro 
transcription, where multiple millimolar concentrations of NTPs are used. 
Various reaction conditions, including buffer conditions, NTP concentrations, 
reaction time, incubation temperatures, and T7 RNAP concentrations were tested 
in Lp2-CN synthesis to optimize the yields. Among all factors, the effects of NTP 
and Mg’* concentrations on PLOR yields are distinctly different from those of 
standard solution-based in vitro transcription. Since the initiation phase is the least 
efficient of the three phases in T7 RNAP in vitro transcription, various conditions 
were tested in an attempt to improve the initiation efficiencies. We found that 
gentle rotation of the mixture of T7 RNAP and DNA-attached beads at 37 °C for 
10 min, followed by addition of NTPs dissolved in the transcription buffer and 
further incubation at 37 °C for 15 min, resulted in a 20% increase of the final 
product compared with incubating all reagents simultaneously at 37 °C for 15 min. 
Incubating the reaction at 37 °C gave higher yields than incubating at 25 °C, 35 °C, 
or 40 °C, as expected. Incubating the initiation reaction mixture at 37 °C for 10-20 
min was sufficient, but incubation for >30 min resulted in decreased yields by as 
much as 50%. Types of anion had little effect on the reaction yield. KCI/MgCl, and 
K,SO,/MgSO, were tested and the product yields were very similar (Extended 
Data Fig. 3b, panel II). Mg** concentration, however, affected yields dramatically 
and was optimal at 6 mM (Extended Data Fig. 3b, panel III). This differs from the 
standard transcription reaction where 20-26 mM Mg’ is optimal. A series of 
K,SO, concentrations (0, 20, 40, 60, 80, and 100 mM) was tested. Yields were 
similar when K,SO, concentrations were between 20 and 100 mM. Transcription 
initiated with 100 mM K3SO, had a higher yield than having no K,SO, present 
during initiation (Extended Data Fig. 3b, panel IV). A series of different values of 
pH of the initiation buffer (4.0, 7.0, 7.5, 8.0, and 9.0) was also tested. As expected, 
the optimal pH for the initiation was between 7.5 and 8.0. We tested the impact of 
NTP concentrations on yields. The initiation phase requires a substantially higher 
NTP concentration (~2 mM) than the elongation phase, but this concentration is 
still relatively low compared with the standard transcription reactions where NTP 
concentrations are typically 4-6 mM. The optimal NTP concentrations for elonga- 
tion using PLOR are in the micromolar range (Extended Data Table 1). This is 
advantageous as it means that the expensive labelled NTPs, which are used only 
during elongation, can be used at low concentrations to achieve a high percentage 
of their incorporation into the RNA. Higher NTP concentrations during elonga- 
tion resulted in substantially lower yields (Extended Data Fig. 3b, panel V). The 
ratio of T7RNAP:DNA was tested for its effect on yield. The yield at 1:1 was similar 
to that at 2:1 and higher than that at 0.5:1. Higher ratios resulted in lower yields. 
The effect of reaction time in the termination step on yield is shown in Extended 
Data Fig. 3b, panel VIII. 


Quantification of experimental yields. The experimental yields were measured 
by quantifying gel band intensities. Transcription reaction (0.2-1 tl) was loaded 
onto a 12% mini-PAGE gel. For quantification, 0.5, 1.0, and 1.5 pmol of pure 
RiboA71 samples were run as a standard. The gels were stained by Sybr Gold 
(Life Technologies) for 10 s, and then visualized using a Gel Doc’ EZ imager 
(Bio-Rad). The stained gel band intensities were quantified using Image Lab 4.1 
(Bio-Rad), and were used to determine the experimental yield of each synthesis 
product, given as a percentage relative to the amount of template used in the 
synthesis (Extended Data Table 2). 

Estimating synthesis efficiencies. The transcription yields depend on both the 
reaction cycle efficiencies and the total number of cycles in the synthesis. For 
simplification, the average termination efficiency was taken as equal to the average 
elongation efficiency. The apparent efficiencies of initiation and elongation, there- 
fore, were estimated using the equation 


Yield=I x E” (1) 


where I and E are the percentage efficiencies of initiation and elongation, respect- 
ively, and n is the total number of cycles used for synthesis. As there are two 
unknown variables, J and E, a second equation was needed. Because the same 
procedures, equipment, and reagents were used to make each NMR sample, 
one would expect similar values for I and E among different samples. Therefore, 
we used a crude method to calculate the efficiencies by setting I and E of 
one sample equal to the I and E of another ‘reference’ sample, and then solving 
for each variable. The same reference sample, $1+Lk1-CN, was used in each 
calculation. For example, the Lp1-CN sample was generated using a five-cycle 
synthesis with an experimental yield of 30.3%. Similarly, S$1+Lk1-CN was gener- 
ated using a one-cycle synthesis with a yield of 40.3%. Using equation (1), then, I 
and E for Lp1-CN were estimated as 43.3% and 93.1%, respectively. Using this 
method for all RiboA71 samples, the average values for I and E were 46.0% and 
87.8%, respectively (Extended Data Table 2). The ‘theoretical’ yields for each 
sample were calculated using their individual efficiencies. 

Discussion of PLOR efficiency. The efficiency of the PLOR method depends 
heavily on a number of factors, such as the number of cycles involved in achieving 
a desired labelling scheme (equation (1)), the inherent initiation and elongation 
efficiencies for a given RNA sequence, the purity of NTPs, the efficiency of the 
washing/filtration cycles, the T7 RNAP quality, and the efficiency of DNA-bead 
attachment. For any given commercially bought NTP, the presence of other NTPs 
can be as high as 5%. Moreover, it is expected that some degree of cross-contam- 
ination among nucleotides occurs during synthesis as a result of carryover from 
one reaction cycle to the next owing to inefficient washing/filtration during solid- 
phase extraction. Such impurity and cross-contamination in the reaction mixture 
in any given cycle may result in non-synchronized transcription, where initiation 
or elongation may pause at non-designated positions, depending on the availabil- 
ity of various NTPs. In addition, sufficient quantities of all four NTPs due to 
contamination may lead to full-length transcripts prematurely (that is, before 
the designed termination phase) as well as re-initiation of transcription by recycled 
T7, again resulting in non-synchronized transcription. During the initiation and 
elongation phases, abortive and full-length transcripts generated as a result of non- 
synchronized transcription are removed during the washing/filtration cycles and 
thus do not affect the purity of the final full-length products collected at the end of 
the termination phase. However, highly purified NTPs and a more robust solid- 
phase extraction may alleviate non-synchronized transcription, thereby increasing 
the yield of the correctly labelled, full-length product. Leaking of DNA templates 
from the agarose beads may also result in lower transcription yields. The leaking is 
possibly due to a high off-rate of biotin binding to streptavidin beads and may be 
mitigated by using glass beads as the solid phase or using mutant streptavidin- 
agarose beads with a much reduced off-rate*’. 

Since it is an enzyme-catalysed reaction, there are several inherent limitations. 
In principle, for a given RNA sequence, the yield of the PLOR method is directly 
correlated to the yield of the regular in vitro transcription synthesis. The specific 
requirement for PLOR is that only three types of residue are allowed in the first 13 
residues. This would probably limit the ability to label residues in the first 13 nt in 
the 5’ end of a sequence. 

Automated RNA synthesis platform. Since the PLOR process consists of repet- 
itive steps and cycles, we designed and constructed an automated platform to 
perform the synthesis. This prototype RNA synthesizer (Extended Data Fig. 2) 
utilizes the modular liquid delivery system made by Zinsser Analytic, and is the 
first instrument to perform automated RNA synthesis using PLOR. The platform 
consists of several modules, including a rotating reaction chamber, a solid-phase 
extraction cube (SPEC), a pair of robotic arms, reaction vessels, four long-arm 
liquid transfer tips and four syringes, a six-way valve for liquid handling, and seven 
stations. The reaction chamber is located at the incubation station (I-station), 
where the reaction vessels containing reagents incubate at either 37 °C (initiation) 
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or at 25 °C (elongation or termination). There are two reagent-stations (R-sta- 
tions): one attached to a cooling unit to keep reagents at 4 °C, and another at room 
temperature. The reagent-addition/filter station (RF-station) has two main pur- 
poses: (1) it is where the reaction reagents are transferred to the reaction vessels, 
and (2) it is where solid-phase extraction is performed through the addition of 
buffers and nitrogen gas, and is connected to a vacuum pump for the removal of 
liquid waste. The wash station (W-station) washes the tips/syringes between liquid 
transfers. The reaction chamber cover and the SPEC are docked at the cover 
station (C-station) and SPEC-station, respectively, when not in use. The operation 
for performing the PLOR RNA synthesis is controlled by a software program with 
customizable input files. Certain types of information require user input to the 
computer based on the individual protocol, such as volumes of the various NTPs, 
for each cycle, to be delivered into the reaction vessels, and the number of reaction 
cycles. Other variables that may be changed less frequently are set as default, 
including the number of wash cycles, the buffer volumes used in filtration, incuba- 
tion times and temperatures, the rotation rate of the reaction chamber, the move- 
ment of robotic arms, SPEC pressure and time, etc. The protocols are divided 
into blocks according to their purpose, and the instrument combines the 
blocks flexibly. 

Here we describe the blocks for the automated synthesis of the Lp1-CN sample. 

Pre-incubation (initiation). After manual addition of the template-attached 
beads and T7 RNAP to the reaction vessels, which are housed in the reaction 
chamber at the I-station, the robotic arms seal the reaction chamber with the cover, 
and the chamber temperature is ramped to 37 °C with gentle rotation. After 10 
min, the robotic arms remove the reactor cover and transfer it to the C-station; 
they then transfer the reaction vessels to the RF-station. 

Reagent addition (initiation). The liquid-transfer system transfers the user- 
specified amounts of ATP, GTP, UTP, and transcription buffer individually from 
their respective stock-solution containers from the R-station (a cooling unit is used 
to keep all stock solutions and the transcription buffer at 4 °C) to the reaction 
vessels in the RF-station. The syringe tips are then rinsed three times with wash 
buffer at the W-station. 

Incubation (initiation). The robotic arms return and seal the cover on the 
reaction chamber, and the reaction chamber begins gentle rotation at 37 °C for 
15 min. Once the rotation stops, the robotic arms move the cover to the C-station 
and then the reaction vessels to the RF-station. The temperature of the reactor 
chamber drops back to 25 °C. 

Solid-phase extraction (SPE) (initiation). The robotic arm attaches the SPEC to 
the top of the reaction vessels, one at a time, at the RF-station. A nitrogen flow at 80 
p.s.i. is applied to the SPEC for 25 s to remove the liquid phase from the reaction 
vessels. The robotic arm then returns the SPEC to the SPEC-station. 

Rinsing (initiation). Wash buffer is added to the reaction vessels by the liquid- 
transfer system, and the tips are then rinsed three times at the W-station. The 
reaction vessels are placed in the reactor chamber by the robotic arm, rotated for 
~1 min, and then returned to the RF-station. 

The SPE (initiation) and rinsing (initiation) blocks are repeated five times, 
followed by one more SPE (initiation) block before proceeding to the elonga- 
tion steps. 
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Reagent addition (elongation cycle 1). This block is the same as the reagent 
addition (initiation) block except that the NTPs added are ATP, CTP, and UTP 
(see Extended Data Table 1). 

Incubation (elongation cycle 1). This block is the same as the incubation (ini- 
tiation) block except that the incubation is performed at 25 °C for 10 min. 

SPE (elongation cycle 1) and rinsing (elongation cycle 1). These blocks are the 
same as SPE (initiation) and rinsing (initiation) blocks, respectively. 
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the SPE (initiation) and rinsing (initiation) blocks, respectively, except that during 
SPE in the termination step, the liquid removed from the reaction vessels is 
collected in sample containers rather than being pumped to the waste container. 
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Initiation 


Addition of initiation components to the reaction vessels 


Gentle mixing for 15min at 37°C 


Solid-phase extraction (SPE) : 
Washing of ternary complex | “== Repeat 5 times 


Elongation 


Addition of select rNTPs according to the desired labeling scheme 


Gentle mixing for 10 min at 25 °C or optimal temperature 


SPE 


Washing of ternary complex Repeat stmes 


Termination 


Final addition of select NTPs 
Gentle mixing for 12 min at 25 °C 


Drop temperature to 4°C 


SPE and collect the liquid phase containing labeled RNA transcript 
Extensive washing and storage of solid-phase for future use 


Extended Data Figure 1 | Synthesis algorithm for the selective labelling of _ this algorithm. The initiation, elongation, and termination stages are shown in 
RNA using the PLOR method. All samples except for fully labelled 71-nt-CN, green, blue, and red, respectively. Various NTP combinations added during the 
S1+LkI1-CN, $1+Lk1-H, and °N-104-nt-TCV were synthesized following elongation cycles depend on the desired labelling scheme. 
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Extended Data Figure 2 | Automated platform for PLOR synthesis. parts and stations. b, Top-view (left) and side-view (right) photographs of the 
a, Diagram of the automated platform for PLOR synthesis depicting its various in-house automated platform. 
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Extended Data Figure 3 | Optimization of experimental conditions for 
PLOR synthesis. a, The products of the PLOR (left) and the standard 
transcription methods (right). PLOR generates a pure full-length product with 
the desired labelling in the final step. b, Comparison of PLOR efficiency for 
Lp2-CN synthesis under various conditions: I, freshness of DNA-attached 


beads; II, anion specificity; III, [Mg*]; IV, K,SO, presence in the initiation; V, 
increasing NTPs (‘1X represents NTP amounts in Extended Data Table 1); VI, 
ratio of ATP/GTP:DNA; VIL, ratio of UTP:DNA; and VIII, incubation time of 
the termination. The right lane contains pure RiboA71 as a control. 
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Extended Data Figure 4 | Comparison of NMR spectra of PLOR-generated the RNAs synthesized by PLOR have the same fold as 71-nt-CN generated 
and 71-nt-CN samples in adenine-bound form. Superposition of the 'H'°C- _ using the traditional solution-based transcription method and are functional as 
TROSY spectra of 71-nt-CN with Lp1-CN (a), Lp2-CN (b), Lp1+2-CN evidenced by binding of the adenine ligand. 

(c), Lk2-CN (d), $1+Lk1-CN (e), and 4nt-CN (f). These results indicate that 
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Extended Data Figure 5 | PLOR-generated riboA71 maintains both 
sequence and structural fidelity. a, Structural superposition of the PLOR- 
generated riboA71 (PDB accession number 4XNR) with that of the riboA71 
generated using the regular in vitro transcription (PDB accession number _ 
4TZX)’*. The root mean squared deviation between all C1 atoms was ~ 0.3 A. 
b, Structural superposition of the PLOR-generated riboA71 (PDB accession 
number 4XNR) with that of the riboA71 (PDB accession number 1Y26)'. The 
root mean squared deviation between all C1 atoms was ~ 0.4 A. c, Sequences 
and secondary structures of the RNAs in b. The arrows denote nucleotide 
sequence differences between the two ribA71 sequences. d, Composite 


simulated anneal-omit 2|F,| — |F.| electron density of the riboA71 RNA 
structure (PDB accession number 4XNR) at 2.2 A resolution calculated using 
the final model (1.0 s.d.). e, Portion of the electron density in c in the adenine- 
binding pocket unambiguously identifies the nucleobase identities of the 
PLOR-generated RNA, revealing no undesired sequence changes introduced by 
PLOR. f, Portion of the electron density in c of the G6*C66 base pair, which 
differs in identity between the two structures, suggesting that if there had been 
undesired sequence substitutions that resulted from using the PLOR method, 
they would have been readily detected in the crystallographic analysis. 
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Extended Data Figure 6 | Using PLOR to isotopically label a 104-nt RNA. '°N-H1-TCV RNA. ¢, Superposition of the 'H!°N-TROSY spectra of !"N-H1- 
a, Secondary structure of the 104-nt structural element in the TCV genomic TCV with °N-104-nt-TCV“4, indicating that the PLOR-generated selectively 
RNA. The °N-H1-TCV was synthesized by the PLOR method, and the labelled RNA has the same fold as that generated using the regular in vitro 
sequence in red is ‘°N-labelled in '"N-H1-TCV. b, '°N-TROSY spectrum of transcription. 
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Extended Data Table 1 | PLOR recipes for the RiboA71 and TCV samples 


NTP addition Cona, 
(NTP coefficients represent the multiples of template) vol.” 
Initiation: 9SATP, 96GTP, 9.6UTP; Elongation (cycle 1): 2ATP, 2CTP, 2UTP; (cycle 2): °C™N-ATP, ri ag 
*3¢'5N-GTP; (cycle 3): °C'®N-ATP, 2'°C'°N-UTP; (cycle 4): 2'°C'°N-GTP; Termination (cycle 5): 10ATP, 
11CTP, 7GTP, 17UTP 


Lp2-CN Elongation (cycle 2): 3ATP, 7GTP, 8UTP; (cycle 3): 3ATP, 3CTP, UTP; (cycle 4): ATP, 2GTP; (cycle 5): 20 uM, 
2'°c'®N-CTP, 2'°C'’N-UTP; (cycle 6): 3'°c’N-ATP; (cycle 7): 2CTP, GTP, 3UTP; Termination (cycle 8): 50 mL 
2ATP, 4CTP, 5UTP 


Lp1+2-CN | Elongation (cycle 2): "C'®N-ATP, “C'SN-GTP; (cycle 3): °C"°N-ATP, 2'°C'°N-UTP; (cycle 4): 2°C'N- ia 
GTP; (cycle 5): ATP, 4GTP, 6UTP; (cycle 6): 3ATP, 3CTP, UTP; (cycle 7): ATP, 2GTP; (cycle 8): 2°°C"*N- 
CTP, 2°°C'°N-UTP; (cycle 9): 3'°C'°N-ATP; (cycle 10): 2CTP, GTP, 3UTP; Termination (cycle 11): 2ATP, 
4CTP, 5UTP 
Lk2-CN Elongation (cycle 2): ATP, GTP; (cycle 3): ATP, 2UTP; (cycle 4): 5GTP, 3UTP; (cycle 5): ATP; (cycle 6): 20 uM, 
*8¢"5N-CTP, “C"®N-GTP, 4°°C'°N-UTP; (cycle 7): '°C"®°N-ATP; (cycle 8): 2'°C'°N-CTP; Termination (cycle 40 mL 
9): 8ATP, 8CTP, 3GTP, 10UTP 


S1+Lk1-CN | nitiation: 96°C=N-ATP, 96°C“N-GTP, 9.6°°C°N-UTP : Elongation/Termination (cycle 1): 14ATP, . ae 
13CTP, 10GTP, 21UTP m 
4nt-CN ; . 13,15 : : , . 130.15 13,15 , 20 uM, 
Elongation (cycle 2): "°c'*N-ATP, GTP; (cycle 3): 2ATP, BGTP, BUTP; (cycle 4): *C"*N-CTP, “c'*N-UTP; | 20 HM 
(cycle 5): 4ATP, 4CTP, 2GTP; (cycle 6): 3ATP, 2CTP, SUTP; (cycle 7): °C"=N-GTP; Termination (cycle 8): 
2ATP, 4CTP, SUTP 
S1+Lk1-H_ | tnitiation: 9SATP, 96GTP, 9.6UTP; Elongation/Termination (cycle 1): 147H-ATP, 13°H-CTP, 10°H-GTP, = ig 
21°H-UTP 
U39-CN Elongation (cycle 2): 3ATP, 7GTP, 8UTP; (cycle 3): 3ATP, 3CTP, °C"°N-UTP; (cycle 4): ATP, 2CTP, ss ae 
2GTP; (cycle 5): 3ATP, 2CTP, SUTP; (cycle 6): ZATP, GTP, 3UTP; Termination (cycle 7): 4CTP, 2UTP m 
RiboA71 Elongation (cycle 2): 3ATP, 7GTP, 8UTP; (cycle 3): 3ATP, 3CTP, UTP; (cycle 4): ATP, 2GTP; (cycle 5): 20 uM, 
(4XNR) 2CTP, 2UTP; (cycle 6): 3ATP; (cycle 7): 2CTP, GTP, 3UTP; Termination (cycle 8): 2ATP, 4CTP, SUTP 30 mL 


*8N-H1-TCV | Initiation: 32°°N-ATP, 240'°N-GTP, 16"°N-CTP; Elongation (cycle 1): ‘°N-CTP, ‘°N-GTP, 3°°N-UTP; 20 uM, 
Elongation (cycle 2): 20ATP, 19CTP, 25GTP, 18UTP 30 mL 


*The initiation and elongation (cycle 1) information are not listed if they are the same as those used in the synthesis of Lp1-CN. 
+The template concentrations and total reaction volumes were listed for NMR and FRET sample synthesis. 
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Extended Data Table 2 | Synthesis efficiencies for the PLOR-generated NMR samples 


Number of Efficiency © Average Efficiency Overall Yield 
Elongation/ 
termination 
Lp1-CN 43.3% 93.1% 24.0% 30.3% 
Lp2-CN raz 49.0% 82.2% 16.2% 10.2% 


bia = ue aul 

Lk2-CN Ea 44.6% 90.2% 14.3% 17.6% 
46.0% 87.8% 

4nt-CN a 47.8% 84.3% 16.2% 12.2% 

U39-CN 44.2% 91.1% 18.5% 23.0% 


*Efficiencies, | and E, for each sample were calculated using the equation Yield = / x E”, where/and Eare the initiation and elongation efficiencies, respectively, and n is the total number of cycles in the elongation 
and termination phases (see previous text on the calculations of synthesis efficiencies). 

+These yields represent the theoretical yields for each sample based on average efficiencies, / and E, and the number of cycles. 

{The yield of °N-H1-TCV is calculated by using / and E from the riboA71 system. 
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Extended Data Table 3 | PLOR recipes for the fluorescent-labelled RiboA71samples 


NTP addition [DNA], 
(NTP coefficients represent the multiples of template) total vol t 


U24Cy3-C55Cy5 Elongation (cycle 2): ATP, GTP; (cycle 3): UTP; (cycle 4): ATP, 5-aminoallyl-UTP; (cycle 5): ATP, 5 uM, 
6GTP, 6UTP; (cycle 6): 3ATP, 3CTP, UTP; (cycle 7): ATP, 2GTP; (cycle 8): 2CTP, 2UTP; (cycle 9, 3 mL 
37°C): 3ATP, Cy5-CTP; (cycle 10): CTP, GTP, 3UTP; Termination (cycle 11): 2ATP, 4CTP, 5UTP 


U24Cy3-C55Cy5-B Elongation (cycle 2): ATP, GTP; (cycle 3): UTP; (cycle 4): ATP, 5-aminoallyl-UTP; (cycle 5): ATP, 5 uM, 
6GTP, 6UTP; (cycle 6): 3ATP, 3CTP, UTP; (cycle 7): ATP, 2GTP; (cycle 8): 2CTP, 2UTP; (cycle 9, 3 mL 


37°C): 3ATP, Cy5-CTP; (cycle 10): CTP, GTP, 3UTP; (cycle 11): 2ATP, 3UTP; (cycle 12): CTP; 
Termination (cycle 13): 2UTP, 3 biotin-11-CTP 


U24Cy3-U65Cy5 Elongation (cycle 2): ATP, GTP; (cycle 3): UTP; (cycle 4): ATP, 5-aminoallyl UTP; (cycle 5): ATP, 5 uM, 
6GTP, 6UTP; (cycle 6): 3ATP, 3CTP, UTP; (cycle 7): ATP, 2CTP, 2GTP; (cycle 8): 3ATP, 2CTP, 3 mL 
5UTP; (cycle 9): ATP, GTP; (cycle 10): 2UTP; (cycle 11, 37°C): ATP, Cy5-UTP; Termination (cycle 
12): 4CTP, 2UTP 


U24Cy3-U65Cy5-B Elongation (cycle 2): ATP, GTP; (cycle 3): UTP; (cycle 4): ATP, 5-aminoallyl UTP; (cycle 5): ATP, 5 uM, 
6GTP, 6UTP; (cycle 6): 3ATP, 3CTP, UTP; (cycle 7): ATP, 2CTP, 2GTP; (cycle 8): 3ATP, 2CTP, 3 mL 
5UTP; (cycle 9): ATP, GTP; (cycle 10): 2UTP; (cycle 11, 37°C): ATP, Cy5-UTP; (cycle 12): CTP; 

Termination (cycle 13): 2UTP, 3 biotin-11-CTP 


U24A555-U65A488-B Elongation (cycle 2): ATP, GTP; (cycle 3): UTP; (cycle 4): ATP, 5-aminoallyl UTP; (cycle 5): ATP, 5 uM, 
6GTP, 6UTP; (cycle 6): 3ATP, 3CTP, UTP; (cycle 7): ATP, 2CTP, 2GTP; (cycle 8): 3ATP, 2CTP, 1 mL 
5UTP; (cycle 9): ATP, GTP; (cycle 10): 2UTP; (cycle 11, 37°C): ATP, Alexa488-UTP; (cycle 12): 
CTP; Termination (cycle 13): 2UTP, 3 biotin-11-CTP 


*Cy3- and Alexa Fluor 555-labelled samples were generated by adding Cy3-NHS ester (GE Healthcare) or Alexa Fluor 555-NHS ester (Invitrogen), respectively, to a solution containing the lyophilized transcript 
product dissolved in double-distilled H20, followed by the addition of 0.5 volumes of 0.3 M sodium bicarbonate (pH 8.3). The reactions were incubated in the dark with gentle mixing for 1 hat 37 °C, 12 hat 25°C, 
and then 1 h at 37 °C. 

+The template concentrations and total reaction volumes were listed for NMR and FRET sample synthesis. 
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Extended Data Table 4 | Crystallographic statistics of data collection and refinement for the 
PLOR-generated riboA71 structure (PDB accession number 4XNR) 


PDB Accession Code 4XNR 


Data collection 


Space group P242;2 
Cell dimensions 
a,b,c (A) 49.5 154.7 25.2 
aBy (°) 90, 90, 90 
Resolution (A) 41.68-2.207 (2.286-2.207)* 
Rmerge (%) 6.1 (103.7) 
Io! 16.9 (1.7) 
CCr2 0.999 (0.578) 
Completeness (%) 100 (99) 
Redundancy 6.6 (6.7) 
Refinement 


Resolution (A) 


No. reflections 


41.68-2.207 (2.286-2.207)* 
10356 (1014) 


Rwork Riree (Yo) 20.4 (36.6) /22.0 (37.1) 
No. atoms 1582 
RNA 1502 
Protein n/a 
lon 6 
Water 64 
Mean B-factors (A) 65.8 
RNA 66.0 
Protein n/a 
Ligand/ion 79.3 
Water 57.8 
R.m.s. deviations 
Bond lengths (A) 0.001 
Bond angles (°) 0.27 


*Highest resolution shell in parenthesis. 
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TECHNOLOGY FEATURE 


ORGANS FROM THE LAB 


The body’s organs are more complex than any factory. Attempts to mirror their 
physiology in the laboratory are getting closer to capturing their intricacies. 


Stem cells can be coaxed into forming organized clusters called organoids, such as this brain model. 


BY VIVIEN MARX 


ratory, researchers have come a long 

way. Engineered tissues are already used 
in medical research and have even entered 
clinical trials. But they are much simpler 
than the real thing. To make a stomach, a 
lab might use 3D printing to create a mould 
that could be seeded with the appropriate 
cells. But without cues provided by blood 
flow and interactions with other tissues, the 
result would be simply a stomach-shaped 
statue, unable to digest or growl. An organ 
is much more than a mass of cells arranged 


I n their quest to create organs in the labo- 


ina particular configuration: it also has sup- 
port scaffolds, blood vessels to deliver nutri- 
ents and signal molecules, and a hierarchy of 
intricate control functions that can respond 
to internal and external cues. 

All this makes it tough to build a func- 
tional, physiologically relevant organ in 
the lab, says Rosemarie Hunziker at the US 
National Institutes of Health, who man- 
ages the funding of programmes devoted 
to designing and building artificial organ 
systems. 

But tissue engineers are making inroads 
into the problem. To try to tackle the bio- 
logical complexity of organs, they can choose 
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from various fabrication approaches. One 
method is to place cells into elaborate, but 
still simplified models of an organ the size 
of a microscope slide, which can then be 
connected together to probe how organs 
interact. These miniature ‘organs-on-chips’ 
provide a unique vantage into organ func- 
tion and disease, and for applications such as 
toxicity tests of drug candidates. An alterna- 
tive approach is to foster the ability of cells 
to self-assemble, in the hope that they will 
recapitulate actual organ development and 
reveal insights into the process. 

Whatever the strategy, researchers can start 
with biologically simple approaches, and >» 
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This unit measures the cells’ 
metabolic activity, such as glucose 
and lactate fluxes, oxygen levels 
and acidification rates. 


4 
» 


HOOKED UP 


> then add complexity to the model a little 
at a time. Just how similar an artificial version 
of an organ needs to be to its original depends 
on the questions that are being asked of it, 
Hunziker says. Artificial organs may look 
very different from their in vivo counterparts 
but nonetheless be useful for drug testing and 
basic research. Whether the goal is to under- 
stand an organ or to replace it, the eventual 
aim is an engineered system that functions 
as reliably as the real thing, Hunziker adds. 
Researchers across the world are using 
these systems to address a wealth of impor- 
tant questions. They can, for example, help to 
reveal how cancer cells detach from a tumour 
to invade other tissues, and allow scientists 
to recapitulate processes in disease and 
development, such as what might go awry in 
neurodevelopmental disorders. 


SYSTEMS THINKING 

The most highly engineered organ models 
are the organs-on-chips that look the least 
like organs in the body. They are made using 
similar manufacturing techniques to those 
for silicon microchips in computers. First, 
a photosensitive material is layered onto 
silicon, and ultraviolet light is used to etch 
grooves in a desired pattern into silicone 
rubber. This guides the production of a 3D 
network of hollow tubes inside a rubbery 
rectangle the size of a computer memory 


am 


fn 


Bioengineers have connected multiple organs-on-chips to replicate human 
physiology. They hope to use the set-up to study the spread of metastatic 
breast cancer to the brain. 


A brain-on-a-chip can be seeded with 
human blood vessel cells, stem-cell- 
derived neurons and neuron-supporting 
cells called astrocytes and pericytes. 


atl | 


y 
f 


stick. The tubes are seeded with cells of the 
desired types and hooked up to pumps and 
an external fluid source, providing inlets and 
outlets through which scientists can mimic 
blood flow and deliver nutrients and envi- 
ronmental signals. Perfusion by continu- 
ously flowing liquid mirrors the dynamic 
environment in organs. The set-up also lets 
bioengineers modulate a tissue’s stiffness as 
well as mechanical, chemical and electrical 
cues to reproduce the signals that cells might 
receive in healthy 


or diseased states, “The aimis an 
says John Wikswo engineered 

of Vanderbilt Uni- system that 
versity in Nash- functions as 
ville, Tennessee reliably as the 
(see ‘Hooked up’). real thing. 


Researchers can 
replicate inflammation, for example, by 
adding the molecular messengers known as 
cytokines and even living immune cells into 
the chips’ channels — they then watch the 
inflammatory response that is characteristic 
of most tissues when damaged or infected’. 
The chips are usually transparent to allow 
high-resolution, real-time imaging of cells, 
says Donald Ingber director of Harvard 
University’s Wyss Institute for Biologically 
Inspired Engineering in Boston, Massachu- 
setts. The liver, kidney, lung, intestine, fat, 
muscle and the blood-brain barrier have all 
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A mammary-gland-on-a- 
chip, which can be seeded 
with healthy or cancerous 
cells. 


been rendered into chip form’. 

Now researchers are combining chips into 
multi-organ systems that can replicate some 
of the body’s physiology. Gordana Vunjak- 
Novakovic and her team at Columbia Uni- 
versity in New York City are building a model 
of the heart-liver-blood system with which 
to probe drug toxicity and disease. Wikswo 
at Vanderbilt, and his colleagues at the Uni- 
versity of Pittsburgh, Pennsylvania, are 
linking organ chips together to predict the 
effects of potentially toxic chemicals and 
drugs. He believes that a liver-kidney model 
could identify safety problems before a drug 
reaches testing in humans, because these are 
the organs in which toxicity first becomes 
apparent. To emulate in vivo situations of 
health or disease, researchers can grow the 
appropriate cells in 3D support structures 
and explore their reactions to cues delivered 
into the system, says Wikswo. 

The key to successful mimicry is attention 
to microstructure, says Hunziker. Careful 
placement of liver cells across a chip can better 
replicate real liver tissue, which has different 
zones close to and away from the main blood 
supply. These zones differ in the genes that 
are active, which results in differences in cell 
development and behaviour, and different 
responses to chemical stresses, she says. 

Microscale organ systems allow experi- 
ments that cannot be done in cell cultures, 
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UAB NORE TISSUE ENGINEERING 


Dan Dongeun Huh at Harvard University’s Wyss Institute tracks cellular events in a lung-on-a-chip. 


animals or people, says Ingber. Organ chips 
lined with cells from individual patients 
enable the assessment of physiological dif- 
ferences between health and disease, and 
between people, in more detail and over a 
longer period than would be practical in peo- 
ple or in animal models, he says. Ingber’s team 
has kept multi-organ chips going for more than 
a month. With an organ chip, it is also possible 
to adjust parameters to see what happens ina 
way that is not possible in a patient. 

Several labs have formed spin-out companies 
to commercialize their model tissues. Emulate, 
in Cambridge, Massachusetts, founded by Ing- 
ber, is developing organ-on-chip systems for 
high-throughput drug screening and toxicity 
testing. The company Hepregen in Medford, 
Massachusetts, co-founded by bioengineer 
Sangeeta Bhatia at the Massachusetts Institute 
of Technology in Cambridge, uses the technique 
of ‘micropatterning’ to develop liver models in 
which different cell types are precisely placed 
to produce a platform that more closely mim- 
ics the complexity of the liver. These are being 
developed as drug-screening assays. Hemo- 
Shear Therapeutics in Charlottesville, Virginia, 
founded by two University of Virginia scientists, 
has developed several organ modelling systems, 
including one that specifically mimics blood 
flow in tissues. In January, HemoShear began 
a collaboration with pharmaceutical company 
Pfizer of New York to find better ways to predict 
injuries to blood vessels, such as inflammation, 
that drug candidates might cause. 

Right now, microfabrication is out of reach 
for many labs. However, there are some com- 
panies that offer services to make chips for labs 
that do not have the necessary equipment or 
expertise. And many universities offer micro- 
fabrication capabilities through core service 
centres. Meanwhile, labs at the cutting edge 
are working to make engineered chips bet- 
ter homes for living cells. One challenge is 
seeding cells evenly throughout the devices 
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and maintaining their growth within the tiny 
channels, says Ingber. Another is that bubbles 
in the system can injure the cells. 


THREE-DIMENSIONAL HELP 

In contrast to organs on chips, soft scaffolds 
seeded with cells can result in artificial organs 
that look much more like the real thing. This 
approach blends a variety of synthetic materi- 
als to make a support system. It is then seeded 
with cells that grow and develop throughout the 
scaffold and thus become arranged in the desired 
configuration. In one well-known example from 
the early days of the field, Linda Griffith at Mas- 
sachusetts Institute of Technology and Charles 
Vacanti at Massachusetts General Hospital in 
Boston and their colleagues used such a scaffold 
implanted under the skin of a mouse to guide 
bovine cartilage-forming cells to grow tissue in 
the form ofa human outer ear™*. The polymers 
in the scaffold degraded as the tissue formed, 
leaving behind the structure made of cartilage. 

Today, Griffith and her team use a custom- 
built 3D printer to create highly intricate tissue 
scaffolds. A stream of photoreactive polymer 
spurts out of the instrument’s nozzle and one 
layer at a time is exposed to ultraviolet light to 
stabilize the structure. Material is removed in 
an iterative process to a build micrometre-scale 
substructure. 

Scientists have also developed ways of mim- 
icking the mechanical stimuli that seem crucial 
to tissue development. For example, the early 
development of teeth ina mammalian embryo 
involves embryonic cells packing closely 
together. To mimic this process, Ingber’s lab has 
developed a polymer that acts like shrink-wrap 
at certain temperatures’. When the polymer is 
warmed to body temperature, it shrinks and 
compacts the cells it encloses, which activates 
genes responsible for tooth development. Bio- 
engineers could potentially use this material 
to induce tissue development for a variety of 
therapies, Ingber says, because cartilage and 
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other internal organs (such as the lungs and 
kidneys) also undergo cellular compaction as 
they develop. 

Incorporating blood flow into a model 
organ is particularly challenging, especially 
when trying to mimic the heart, which pumps 
rhythmically for a lifetime. Nevertheless, tis- 
sue engineers are well under way in their 
search for therapies to help heal injured hearts, 
and eventually perhaps, to find alternatives 
to heart transplants. Starting with a cell-sheet 
technology that does not incorporate a scaf- 
fold, Teruo Okano, a biomedical engineer at 
Tokyo Women’s Medical University and his 
colleagues have made vascularized heart-tissue 
patches. The experiments start with thin layers 
of cells, which they can grow from a variety of 
cell types, including rat neonatal cardiac cells, 
human muscle cells and induced pluripotent 
stem (iPS) cells. These sheets are grown in 
dishes coated with a temperature-sensitive 
polymer. When the temperature is lowered, 
researchers can harvest sheets of cells that 
remain connected to each other without any 
kind of scaffold, says Okano’ colleague Tatsuya 
Shimizu. In ongoing clinical trials, the team is 
evaluating 30 patients with heart problems who 
have received implanted tissue patches made 
from muscle-cell-derived sheets. These sheets 
secrete several types of cytokine, which pro- 
mote blood-vessel formation and inhibit cell 
death in the patient's heart tissue. In the future, 
Shimizu and his colleagues hope to transplant 
tissues with beating cells. 

But these sheets are not yet optimal. The ideal 
grafts need to be thick, especially because events 
such as heart attacks lead to thin heart tissue. 
The team has returned to the lab to engineer 
thicker patches that will be infiltrated with even 
more blood vessels 
and should remain 


it = 

Fi ngineered viable for longer than 
=—_ poke the previous versions. 
starting to F They have grown cell 
allow imcistve sheets from human 
experiments iPS cells and trans- 
and even planted them into rats 
replacement just under the skin on 


therapies. » their backs, building 
up a patch 1 millime- 
tre thick made of 30 cell sheets’. After implanta- 
tion, small blood vessels from the rat sprouted 
through the layers. By moving smaller stacks 
into more vascularized areas, the researchers 
were able to cause more and more blood ves- 
sels to grow and eventually to connect the stack 
directly to larger blood vessels, such as the jug- 
ular vein. The heart-muscle cells continued to 
beat during six months of observation. 

However, similar multiple surgical interven- 
tions could not be carried out in people. So the 
researchers have developed a technique that 
relies on a gel on which they can grow multiple 
layers of rat-cell sheets in the lab. 

One day, Okano and his team hope, it will 
be possible to engineer such grafts for use in 
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humans with severe heart failure. The general 
approach could also be applied to engineer tis- 
sue to mimic the liver or kidneys. 


SELF-ASSEMBLY 

Other teams rely even more heavily on the 
intrinsic ability of cells to assemble into com- 
plex structures. Stem cells grown in suspension 
can be coaxed to form organized clusters called 
organoids, and these have been made for diverse 
tissues, including intestine, kidney and retina. 
Organoids are usually much smaller than the 
actual organ, just a few millimetres across, and 
with a much simpler assortment of cells, but 
some teams are now making organoids with 
more cell types and more complex structures, 
and even attempting to model the most daunt- 
ing organ — the brain. 

In 2013, Madeline Lancaster and Juergen 
Knoblich at the Institute of Molecular Biotech- 
nology of the Austrian Academy of Sciences in 
Vienna generated human brain-tissue orga- 
noids about the size of a lentil’. They started 
from groups of human pluripotent stem cells, 
which are differentiated into neural tissue. Part 
of the protocol is to let the biology unfold. 

Under the right conditions, the differenti- 
ating cells self-organize into a tight swirl of 
neural tissue with multiple cell types, includ- 
ing radial glial stem cells that give rise to cells 
in the brain such as neurons. The swirls even 
include rudimentary brain structures such 
as the beginning of a forebrain and retina. 
“We pretty much recapitulate the formation 
of neural tissue in a dish, letting it develop as 
it does in the embryo,” says Knoblich. 

These cerebral organoids have helped them to 
address questions that are hard to answer when 
growing neurons flat on the surface of a culture 
dish. The team studies the human neurode- 
velopmental disorder microcephaly, in which 
infants have markedly small brains. Although 
mice can be used to model the disorder, the 
animals do not show the extreme difference in 
brain size. But when the team reprogrammed 
skin cells from a patient with microcephaly into 
iPS cells that developed into cerebral organoids, 
the resulting structures bore clear characteris- 
tics of the disease. In these organoids, the radial 
glial cells proliferated less and, in some regions, 
differentiated into neurons prematurely. Even 
under normal conditions, radial glial cells do 
not proliferate in developing mice the way they 
do in humans, and so human organoids are a 
promising way to study how these neural pre- 
cursor cells might be involved in the disorder. 

Lancaster and Knoblich also used organoids 
to assess the effects ofa gene called CDK5RAP2 
that helps to guide cell division. The patient with 
microcephaly had a mutation in this gene that 
probably results in an aberrant protein. When 
the team introduced an undamaged protein 
into the organoid, some cells developed into 
types akin to radial glial cells, indicating that 
the loss of function of this gene contributes to 
microcephaly’. 


ORGAN BUILDING 


Instead of replacing damaged or diseased 
organs, some labs have attempted to stimulate 
organ regeneration. Researchers have tapped 
into the innate chemical and bioelectric signalling 
of flatworms to induce the development of a 
second head. The regeneration occurs because 
the altered pattern is stored across the animal’s 
bioelectric network. 


1] An adult normal flatworm. 


2 | Manipulation of the worm’s bioelectric network. 


The worm is permanently altered by rewriting 
the memory of what should be regenerated. 


There are still plenty of challenges for orga- 
noid technology. Lancaster and Knoblich point 
out that their organoids lack a blood supply and 
the interaction that neural tissue normally has 
with surrounding tissue. Over time, the orga- 
noids begin to die and lose resemblance to early 
brain tissue. The team has managed to keep 
them alive for as long as a year, but how useful 
late-stage organoids are for disease modelling 
remains to be seen, says Knoblich. 

Another challenge is consistency, because 
the organoids take on different shapes from 
one batch to the next, he says. The lab is 
continuing to tinker with the growth condi- 
tions in the hope of overcoming these prob- 
lems and being able to model more complex 
neurodevelopmental disorders. 


LIKE AN EMBRYO 

Bioengineers intent on building organ mod- 
els can tap into the complexity of intercellular 
signals — the wealth of biochemical and bio- 
electrical messages that tell cells to differenti- 
ate, migrate, change shape or clump together 
to form an organ. This approach has already 
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been used to regenerate legs on frogs that are 
normally too old to naturally regrow an ampu- 
tated limb. That work, led by Michael Levin, a 
biomedical engineer at Tufts University in Med- 
ford, Massachusetts, might translate more read- 
ily to humans than many expect: children can 
regenerate fingertips, but adults cannot. 

To get the frogs’ legs to regrow, Levin and 
his team chemically tinkered with the pat- 
tern of electrical charge in the limb so that it 
matched the bioelectrical gradient found in 
the limbs of young animals*, and this induced 
the cells at the tip of the amputated limb to 
grow. They also induced the formation of a 
two-headed flatworm (see ‘Organ building’). 
The alterations the team had made caused a 
permanent change in the memory of what 
to form, which was encoded in an electrical 
circuit just like memories in our brains. Levin 
describes the process as “manipulating infor- 
mation” within the tissues in ways that cause 
predictable, large-scale changes in growth 
and form. That, he says, “makes the job of 
growing anything much easier”. 

Instead of trying to micromanage organ 
building, Levin believes in leveraging the 
body’s own processes. He and his team are 
developing a physiological ‘phrase book of 
mathematical models and software. Scien- 
tists can use these software tools to search for 
factors to manipulate in their experiments, 
and so find ways to tell cells what tissues 
to build. The goal is to link data sets about 
genes, proteins and signalling pathways to 
knowledge about how organ shape and func- 
tion is regulated. “These are the kinds of tools 
that will be indispensable as bioengineers 
confront the complexity barrier facing the 
creation of even simple organs,” says Levin. 

Ultimately, the usefulness of the tool is what 
is important, not the specific approach that is 
chosen. Engineered tissues are starting to allow 
incisive experiments and even replacement 
therapies. And perfectly mirroring nature may 
not, in all cases, be what is needed. “What is 
critical is that the organ has enough complex- 
ity to accomplish its function,” says Hunziker. 

Whether it be a patch for damaged hearts, 
a better toxicity test or an insight into a dev- 
astating brain disease, tissue engineering 
delivers what scientists crave: more under- 
standing, and the potential to help people. m 


Vivien Marx is technology editor for Nature 
and Nature Methods 
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COLUMN 
Reset your brain 


A bit of self-reflection can propel scientists into a career 
beyond the lab, says Christopher Taylor. 


cancer researcher Clayton Boldt was 

having a bad day. Several weeks’ worth 
of experiments had yielded little in the way of 
results, his cell cultures were not growing, he 
had mixed up his samples on a western blot, 
and he had no spares. As on other tough days, 
he began an internal pep talk: he was preparing 
for his future, a research career that would ful- 
fil his love of science. 
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But as Boldt reviewed what was good about 
his work, he came to an astonishing realization: 
he was uncertain whether he actually wanted 
to pursue academic research. For the first 
time, he asked himself question that occurs 
to almost every biomedical trainee who is 
considering the next stage of her or his career: 
‘Where do I go from here?’ 

In his quest to solve his dilemma, he reached 
out to me for guidance. As the associate director 
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of the Center for Professional Development & 
Entrepreneurship at the MD Anderson Cancer 
Center in Houston, Texas, I have had the oppor- 
tunity to counsel hundreds of biomedical 
trainees who, like Boldt, find themselves ques- 
tioning their commitment to academia. AsI do 
for all in his situation, I advised Boldt to create 
what I call a career-exploration framework. This 
process involves four central steps: understand 
yourself, learn your options, identify any gaps 
in your knowledge or experience and, finally, 
launch your plan. Although these steps might 
look simplistic, each part is important and 
builds on the previous one. 

Boldt, for instance, already knew that he 
enjoyed writing and talking with non-scientists 
about science. In 2011, he had launched a 
blog to pose scientific questions and topics 
to the online community and had begun to 
develop a reader base that participated in these 
discussions. This came up as we talked about 
his career-path challenges and where he saw 
himself after his training. So, together, we cre- 
ated a framework that helped him to clarify 
his strengths and improve his candidacy for a 
communications position that would dovetail 
with his interest in and passion for science. 

Today, Boldt works as a senior communica- 
tions specialist for MD Anderson. He writes 
opinion articles, press releases, pitch letters and 
blogposts in support of the institute's division 
of cancer prevention and population sciences. 
“The path out of academic research is not par- 
ticularly well paved. It is certainly not something 
we are trained for, he says. “However, when 
you've found something that excites you, don't 
be afraid to let it guide you into the unknown?” 

Boldt is a good example of someone who 
successfully crossed the bridge from lab 
research into the non-academic world. Career 
doubts can paralyse any of the thousands of 
people who are pursuing PhDs and postdoc 
training today. By following my four-part 
plan, scientists can create a career-exploration 
framework that can help them, too, to realize 
a non-academic career track. 


KNOW YOURSELF 

The first step can be difficult — creating a 
personal framework requires introspection, 
reflection and taking a frank inventory of your 
life. Biomedical trainees can view this step as 
a way to identify their ‘professional DNA: 
personality traits, core values, education, 
technical training, work experience, soft skills, 
accomplishments and overall life experience 
(see ‘Map your professional DNA). > 
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> — Researchers can use different methods 
to sketch out each of these areas. For example, 
taking the Myers-Briggs Type Indicator 
personality test (see go.nature.com/rh7koz) can 
help scientists to determine their preferences, 
strengths, weaknesses and social compatibility. 
Another option is to create a checklist that iden- 
tifies skills that are transferable across different 
work settings — communication, negotiation 
and project-management abilities, for instance. 
Scientists should also make a detailed record of 
obligations and functions for which they are 
responsible in the lab. 

It is important to articulate these attributes 
in written form because they help to sculpt the 
professional DNA map that will become part of 
the trainee’s framework. Without this holistic 
view, it will be hard for scientists to identify 
gaps in their skills or experience. This informa- 
tion illuminates the trainee’s current candidacy 
potential while also revealing any preferences 
for specific job functions, workplace settings 
and other specifics that they might wish to 
incorporate into a future position. 

Ihave developed training exercises that aim 
to guide people towards such self-understand- 
ing. One of these, known as ‘20 words; helps 
scientists to start to outline their professional 
and personal identities and flesh out their 
DNA maps. They ask 8-12 people who are 
close to them, such as friends, family members, 
colleagues and supervisors, to choose 20 words 
to describe them. Such lists highlight positive 
and negative perceptions that might need to be 
emphasized or mitigated to potential employers 
during a job search. Common descriptors also 
measure how consistent the young scientist is 
across professional and personal settings. From 
here, scientists have a baseline from which to 
explore their career options. 


UNDERSTAND YOUR OPTIONS 

Opportunities for early-career biomedical 
researchers to land tenure-track academic 
positions are shrinking, and US federal funding 
remains highly competitive. This supply- 
demand imbalance is well documented (see 
go.nature.com/aq8qlu). In 2011, for instance, 
the US National Institutes of Health (NIH) 
established a workforce committee to investi- 
gate the sustainability of biomedical research. 
Last February, committee co-chair Sally Rockey, 
deputy director for extramural research at the 
NIH, gave a lecture at MD Anderson on the 
situation facing PhD graduates. 

Rockey explained that in 2012, some 
145,000 biomedical PhD graduates were in 
full-time positions in the United States, and 
nearly as many postdocs and graduates were 
in the pipeline. Although unemployment was 
at a low for biomedical PhDs, fewer than half of 
those employed full-time were doing academic 
research or teaching, and onlya small portion 
had tenure-track positions. 

But trainees need not despair: the biomedical 
enterprise is not limited to academic research 
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MAP YOUR PROFESSIONAL DNA 


Scientists can develop a career-exploration framework to help them to identify 
opportunities for non-academic paths that align with their professional DNA. An example 
rubric might include desired job functions, workplace-environment characteristics, 


professional interests or specific business sectors. 


Professional DNA 
maps a person's 
background, traits 
and talents, and will 


sculpt each area of 
the framework. 


* Skills 
* Abilities 

* Education 

* Knowledge 

* Personality type 
* Values 

* Work style 


alone. Scientists can take advantage of their 
newly mapped professional DNA to find 
non-academic career choices that suit them. 
Although they typically spend years honing 
their scientific-research skills in a narrow dis- 
cipline, many of those abilities — such as data 
analysis and conflict-resolution and presenta- 
tion skills — are applicable in a broad range of 
industries and positions outside academia. 
Boldt learnt the benefits of transferable skills as 
he prepared to embark on his new career path. 
“One of my biggest concerns was whether I had 
wasted the half decade it took to earn my PhD? 
he says. “I’ve realized, though, that my training 
developed skills I wasn’t even aware of — perse- 
veranice, project management and communica- 
tion” He notes that, beyond identifying those 
abilities, the other challenge for early-career 
scientists is to find a position in which they can 
continue to satisfy their love of science. 

Career choices are numerous across the 
biomedical and bioscience enterprise, which 
consists of at least ten sectors: biopharmaceu- 
ticals, devices and diagnostics, consulting, 
law, finance and investments, government, 
non-profit organizations, education, health- 
care, and media and journalism. As scientists 
narrow their preferences to specific disciplines, 
they will become better equipped to advance 
to step three of the framework: addressing any 
holes in their background. 


IDENTIFY GAPS 

For this step, biomedical scientists should first 
research regional, national and international- 
trade and membership groups and associations, 
leading companies and the positions that exist 
in each of their desired sectors. Perusing these 
networks will allow trainees to gain an under- 
standing of the experiences and skills that will 
make them competitive for a spot, and thus will 
flag any potential gaps in those areas that they 
might need to tackle. 
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When reading employer expectations, it is 
important not to focus on job titles, but rather 
on functions, responsibilities, work environ- 
ment and aspects beyond degrees or technical 
skills. Different companies might use variations 
of titles for a role that carries the same respon- 
sibilities, so an online search for ‘research 
scientist’ could exclude positions that are 
equivalent in function, but different in name. 
Trainees seeking employment will discover that 
many disciplines are recruiting for similar posi- 
tions; for example, the biopharma, device and 
diagnostics and health-care industries may all 
be looking for a quality-assurance specialist. 

Trainees can further pinpoint gaps by 
examining job postings and by networking — 
setting up coffee chats, lunches and informa- 
tional interviews with leaders and executives 
in a discipline of interest. It is also important 
to attend relevant meetings in that sector, 
including conferences and trade-and-industry 
organization conventions. The trainee’s goal 
is to talk to employers and managers in the 
business to develop a clear picture of both the 
minimum and preferred qualifications — and to 
assess how her or his own skills, knowledge and 
experience measure up to these requirements. 

Armed with this information, scientists can 
fill in gaps by taking advantage of internships, 
classroom or online courses at their institu- 
tion, and Internet communities — LinkedIn, 
for example — that bring together members 
of a specific industry or discipline. Once they 
have solidified their competitiveness in these 
ways, they can start to write their applications. 


LAUNCH THE PLAN 

Although a PhD is essential for a scientific 
discipline, it is also important to recognize that 
one’s degree — or postdoc experience — does 
not necessarily train one for the work world 
outside academia. Before launching into the 
career hunt, graduate students and postdocs 
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should keep timing in mind: those who are 
more than a year away from the end of their 
programme should continue to improve 
their marketability in the industry in which 
they will seek ajob. This could include learn- 
ing fundamental business principles or 
earning a certification of value in that sector, 
such as a Project Management Professional 
certification in health care. 

Those who hope to enter the job market 
in the next year, however, should focus on 
their job search, including such factors as 
CV development, professional branding, 
interview preparation, networking and the 
job-application process itself. 

The CV should spring from the results 
of the professional DNA map, and needs to 
highlight transferable skills and specific areas 
of expertise (see go.nature.com/vlnhnm). 
Professional branding calls for, at least, a 
robust LinkedIn profile that mirrors the CV 
and lists research projects, papers, presenta- 
tions and peer recommendations. Tweeting 
publications and talks can also be helpful. 
To prepare for an interview, trainees should 
video themselves answering sample inter- 
view questions — examples are available 
at online career forums and publications 
(see go.nature.com/oiaoik) — and review 
the footage to understand how a potential 
interviewer might perceive them. 

Trainees should also list 10-15 profession- 
als in their desired 


industry to contact “When you ve 
for informational found something 
interviews, and that excites you, 
use LinkedIn, don’tbeafraid 
company websites to let it guide 

and other online youinto the 
resources. It is unknown.” 


important to gain 

insight into how these professionals work 
and to prepare for interactions with them. 
Finally, applicants must remember to cus- 
tomize their applications and CVs to address 
each job advertisement, rather than using a 
generalized portfolio for all. 

Change is difficult, especially after doing 
the same thing in the same environment 
for most of one’s adult life. But making the 
decision to embark on a new direction is 
often the hardest part. Sometimes taking 
that different path means believing it is OK 
to leave academia — even though academic 
culture may not support that belief: Having 
the proper tools can reassure biomedical 
trainees who worry about academia’s locked 
gates — and help them to prepare for and 
pursue other careers that will offer an abun- 
dance of satisfaction and success. m 


Christopher Taylor is the associate 
director of the Center for Professional 
Development & Entrepreneurship at the 
University of Texas MD Anderson Cancer 
Center in Houston, Texas. 


TURNING POINT 
Paul Tesar 


Paul Tesar discovered a new type of mouse 
stem cell in 2007 and published his discovery in 
Nature as a graduate student at the University 
of Oxford, UK. The paper launched him rapidly 
to a professorship at Case Western Reserve 
University (CWRU) in Cleveland, Ohio, where 
he continues to drive the field forward. 


How did your graduate experience shape your 
career pursuits? 

I was part of a joint programme — the US 
National Institutes of Health (NIH) Oxford- 
Cambridge Scholars Program — and split 
my time working with NIH human-stem-cell 
biologist Ron McKay and Oxford mouse 
embryologist Richard Gardner. I had a great 
deal of independence from day one. My first 
publication was a solo-authored paper in the 
Proceedings of the National Academies of Science 
(P. Tesar Proc. Natl Acad. Sci. USA 102, 8239- 
8244; 2005) — a feat almost unheard of today. 


How crucial was the fellowship to your stem- 
cell discovery? 

I didn’t set out to discover another pluripotent 
state. But my ‘eureka’ moment came when I 
derived mouse embryonic stem cells with 
similar characteristics to human ones, includ- 
ing the ability to differentiate into all other 
tissue types, known as pluripotency (P. Tesar 
et al. Nature 448, 196-199; 2007). It took a 
while to prove that there was more than one 
pluripotent state, but that paper triggered a 
frame shift in the field. Epiblast stem cells would 
have been discovered anyway, but drawing 
the NIH and Oxford research together put 
that revelation in our hands. So strongly do 
I feel that dual mentoring arrangements are 
fruitful, that I encourage my students to find 
mentors with different skill sets. 


What happened after the epiblast publication? 
I was looking for my next position and wanted 
to maintain my level of independence. I also 
wanted to go back to Cleveland someday, and 
the CWRU — where I earned my bachelor’s 
in biology — encouraged me to return. After I 
explained my goals, they created a one-off posi- 
tion that gave me a few years of funding and my 
own independent lab straight away. Things went 
better than I could have hoped. I hired good 
people and published some strong papers. The 
next year, at 28 years old, I applied for a faculty 
position. It was a whirlwind experience. 


How did you maintain momentum? 
In 2010, I was named a New York Stem Cell 
Foundation Robertson Investigator, which gave 
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me US$1.5 million in innovation funding. That 
was enough to grow my lab from 3 to 12 mem- 
bers and to drive our research into unexplored 
areas. It was such a new field that I was able to 
rapidly address some major questions. 


Is stem-cell biology entering a ‘golden age’? 
People have expected much from this promising 
field; it is time to deliver. Stem-cell transplanta- 
tions are feasible and will continue to advance 
—and we are learning howto control stem-cell 
populations to perform specific functions. For 
example, recently, we screened for drugs that 
can stimulate the generation ofa brain-cell type 
that is typically lost in diseases such as multiple 
sclerosis. There has been a huge uptick in the 
number of stem-cell-based trials, and the results 
of those will guide the future. 


Do you do much work at the bench? 

I would love to, but I spend most of my time 
writing grant proposals and renewals, progress 
reports and paper reviews. Unfortunately, I 
am really there only during the holidays after 
I have sent everyone home. They trust me 
enough to grow cells. 


What is the best career advice that you have 
given or received? 

They are one and the same: maintain work-life 
balance. The 24-7 culture of always answer- 
ing e-mail and working through the night is 
counterproductive. I tell my students that work- 
ing 12-15 hours a day does not equal a Nature 
paper; being smart about which experiments 
you pick will get you high-profile publications. 
When people are happy and stress-free, they can 
be creative and explore new areas. m 


INTERVIEW BY VIRGINIA GEWIN 


This interview has been edited for length and clarity. 
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BY TIM CASSFORD 


not like they’re silent one second and 

there the next. There’s a gradual shift 
between the two states, I suppose. Some- 
times it’s only when you stop to think clearly 
that you notice they're there. Or that they’ve 
gone, and the world is silent again. 

The pills help. I know they help and so I 
take them, but then everything's ok, so why 
am I taking any pills? Am I sick? It goes like 
this, round and round and round. Some- 
times I wish it would end, that things would 
just be ok, and that would be it, finished, like 
a good book. But life isi’t like a book, never 
mind a good one. It just goes on and on and 
on, slowly getting quieter and quieter, until 
it’s only when someone stops to think clearly 
that we can see that it’s gone. 

So this is what I do, as often as I can 
remember: after I come into the house, and 
the doors shush behind me, I go and sit on 
the edge of the sofa bed, and I close my eyes, 
and I listen. Really truly listen. And I can 
hear the gentle thub thub of blood as it pulses 
through me. And I can hear the tiny creak in 
my neck as I move my head from side to side. 
Ican hear the distinct hums of the refrigera- 
tor, the air conditioning, the air filters and 
the computer screen. The susurrus of my life. 
But nothing else. 

The voices have stopped. 

I think of the tablets, and I bite down on 
my thumbnail. It was healing well. I go over 
to the computer and I start playing some 
games. Messages flash up at the bottom of 
the screen, my online friends. Sometimes I 
reply to them, but not often. Everyone knows 


FE hard to say when the voices start. It’s 
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ALL, ALONE 


Comfort comes. 


they’re not real people, only ghosts of acom- 
puter program to try to convince you that 
you exist, that people care for you, that your 
life has worth beyond your ability to con- 
tribute to the economy. Everyone knows that 
they’re not real, but who is everyone? I try 
to think of the last time I saw someone in 
the flesh, but the memory peters out, elusive 
as love. 

The clock on the display wall catches my 
eye as its numbers silently shift upwards. 
I used to hate the clock, resent its silent 
plodding. You can’t get rid of the clock, 
only change its size and style. For a while I 
shrunk it down as small as it would go, but 
then I kept thinking it was a fly or a bug on 
the wall, and that was even more distracting. 
The last thing I need is to be seeing flies that 
arent there. So now it does its impotent best 
to attract my attention by being inconspicu- 
ous. It’s half past two. I have no idea which. 

I try to do some work. I check my mail to 
see if my manager has noticed any change in 
my output, but no. There are several polite 
reminders noticing that I haven't taken my 
pills in, I count back the dates, five days now. 
The first few messages look like generic cir- 
culars, the tone of the last two is more press- 
ing, more urgent. Am I sick? Do people get 
sick so quick? 

I get up and stretch and yawn. The clock 
moves its hands much more gracefully. I 
go over to the refrigerator and take out my 
cold box of medication. I scan the barcode 


through and the pack 
> NATURE.COM pops open. I roll the 
Follow Futures: pill out into my hand 
Y @NatureFutures and it sticks to it 


E} go.nature.com/mtoodm = awkwardly. I hadn't 
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realized I was sweating. I swallow the pill 
down and reach for the glass of water already 
waiting for me. There’s no bitterness. 

Someone once told me that drug com- 
panies would do their best to name their 
new medications with as many Xs, Ys and 
Zs in as possible. People would think they 
worked better if they couldn't pronounce 
the names. Or maybe I read it somewhere. 
Whata world. 

These pills work, everyone knows it. It’s 
only a problem when you forget why you 
need them, when youre not sick; when you 
think you're well. 

It’s hard to say when the voices start. 
There’s a shift, a gradual shift, and little by 
little you notice how much more comfort- 
able everything is, like the whole world is just 
that much better. Like drinking hot choco- 
late on a crisp, cold day, or relaxing into a 
hot bath, or drinking a glass of fine wine, or 
reaching the end ofa good book. 

I try to remember a time before the pills 
came along, before they were necessary, 
before we all needed to hear the voices, but I 
can't. Maybe we spoke to each other; maybe 
we were each other’s voices, each propping 
the other up. But who would do that? 

When the voices come, they are a gentle 
whisper in my ear, a soft rustling of leaves, 
a faint birdsong. Gradually they take form, 
and I'm happy to wait. They’re speaking to 
me. They’re for me. I listen. Tears of happi- 
ness form at the corners of my eyes. 

“You are loved.” m 


Tim Cassford is a doctor and lives and 
works in West Sussex. None of the problems 
with the NHS are his doing. Probably. 
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he Nature Index is a unique 

database that tracks affiliations in 

research publications in a select 
group of scientific journals. The Index 
can provide an indicator of high-quality 
research contributions from institutions, 
countries, regions and disciplines. In this 
Global Nature Index supplement, we 
present a snapshot of the Index for the 
calendar year 2014. 

We have grouped countries into nine 
regions. The strongest performances 
come, not surprisingly, from North 
America, North & West Europe, and 
East & Southeast Asia. In fact, these three 
regions accounted for 91% of the Index’s 
weighted fractional count (WFC), a 
metric that apportions credit for each 
article according to the affiliations of the 
contributing authors. 

Papers from across the natural science 
disciplines contribute to the Nature 
Index: chemistry, life sciences and 
physical sciences each represent about 
one-third of the total, with earth and 
environmental sciences accounting for 
less than 10% (due to the interdisciplinary 
character of research, some articles count 
in more than one category; as a result, 
the total adds up to more than 100%). As 
the Global Overview (page S2) shows, 
there are some striking differences in 
strengths between regions. For example, 
North American institutions (page S4) 
as a group are strong contributors in the 
life sciences. By contrast, almost 90% of 
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p22, 


the contribution from Central & South 
Asia comes in chemistry or the physical 
sciences, 

The Index also provides insights into 
how scientists team up with colleagues 
from other regions. Contributions of 
researchers from regions that lack a 
strong scientific infrastructure often 
result from collaboration with better- 
resourced colleagues in better-resourced 
parts of the world. Indeed, the highest 
level of collaboration was found in 
Africa (page S28), where scientists’ 
contributions to Nature Index papers 
were almost three times more likely to 
occur in papers with researchers from 
other regions than were contributions 
from North American scientists. 

Funding is a big driver of publishing 
performance. For example, Israel's 
GDP ranks 37th globally, but it placed 
17th for WFC in the 2014 Index. 
Much of that imbalance comes from 
investing in science and technology: 
Israel put 3.9% of its GDP into science 
and technology, a higher percentage 
than any other country. 

The Nature Index 2015 Global helps 
us start to more deeply explore the 
relationships that have an impact on 
high-quality research output. 
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GLOBAL OVERVIEW 


Three regions — North America, North & West Europe, and East & Southeast Asia — 
produced 91% of the overall 2014 output (WFC) in the Nature Index. 


TO EACH THEIR OWN , : a 


Relative to its overall WFC, each region shows 
a unique subject distribution in the Nature 
ndex. North America is extremely strong in 


4 
ee a L 
he life sciences, whereas contributions from 3 
Central & East Europe and West Asia are North America 
: : 


predominantly in the physical sciences. WFC: 19,425 
East & Southeast Asia are strong in 
chemistry. North & West Europe and 


Germany 
came third 


4 fe : ‘ globally for 
Australasia & Pacific Islands split their output in the 
contributions relatively evenly amongst Index with a 

hese broad subject areas. 2014 WFC 
of 4,019. 


he United States 


. * produced 38% of the total 
Research output by subject WFC in 2014, although it 
The graphics on the map show the only places 10th in the 
rtion each subject area world for research and 
Propo! > . . development (R&D) 
contributes to a region’s overall WFC*. unding as a percentage 
of gross domestic product 
(GDP). 
ee 60% He 
‘ 50% Dy, 
Se. 
& “740% Z j 
- (30% The total number of Tine 
+20% 2014 articles in the : 
bio% Nature Index. Caribbean Islands 


Although Brazil came 23rd : 

in the world for WFC, it led WFC: 574 
that category for the Latin 
America & Caribbean 
Islands region, and it also 
invested the most in R&D 
as a percentage of GDP. 


"Subject areas can overlap, so the total percentage 
may exceed 100%. 


REGIONAL COLLABORATION PATTERNS ENVIRONMENTAL COLLABORATIONS RULE 


Most regions collaborate extensively with each other. African researchers are particularly active in forming International collaboration rates differ not only across regions, 
connections with their counterparts in other regions, collaborating on over 70% of all their output in the but also across disciplines. In particular, researchers in the 
Nature Index. The three heavy-weights — North & West Europe, East & Southeast Asia and North America earth and environmental sciences reach out internationally.* 


— show the most regionally independent signature in their approach to collaboration.* 
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Latin America & Caribbean Islands 
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“Bars represent the proportion of a region's overall output in the Index (AC) *Graphs represent the proportion of a region's overall output in the Index (AC) in each 
stemming from collaborative efforts with other regions. subject area stemming from international collaborative efforts. 
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Acronyms 


AC: article count 
(number of papers) 


Regions Bi Central & East Europe 

WB North America West Asia 

IB Latin America & Caribbean Islands [) Central & South Asia 

MB North & West Europe East & Southeast Asia 

BB Africa i Australasia & Pacific Islands 


FC: fractional count 

(apportions article count for each contributor affiliation) 
WFC: weighted fractional count 

(applies a weighting to FC to correct imbalance 

in number of astrophysics papers) 


North & West Europe L o ; 
WFC: 15,785 F 
Em fa Central & 
; East Europe 
e4 


WFC: 1,091 


/ 


Central & South Asia 
WFC: 944 


China’s WFC grew 
by 16% from 
2013 to 2014. 


East & Southeast Asia 
WFC: 11,464 


India’s WFC 
increased by 
8.3% from 
2013 to 2014. 


Australasia & 


Pacific Islands 
WFC: 1,048 


Australia finished 12th 
in the world based on 
its WFC in 2014. 


Of Africa’s top 10 institutions by 
2014 WFC, 9 were in South Africa. y 


SPREADING THE SUBJECT SPECIALTIES 


Each subject area in the Index shows a unique regional contribution pattern according 
to WFC. Three regions — North America, North & West Europe, and East & Southeast 
Asia — account for most of the WFCs across all disciplines. 


Enlarged 300% 


— 


EARTH AND 


ENVIRONMENTAL 
SCIENCES 


Eight of the world’s top 10 
institutional contributors to 
the Nature Index in the life 
sciences institutions in 2014 
were in North America. 


Top institutional chemistry contributor 
the Chinese Academy of Sciences 
accounted for three times the WFC 
compared to second-place French 


National Centre for Scientific Research. 


GOING VIRAL 


In general, a region’s WFC correlates with its total Altmetric score. 
The exception is in East & Southeast Asia, where researchers might 
not publicize their work as well as scientists from other regions. 
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North America 


This region includes lots of strong institutions — from 
government agencies to universities — but curtailed 
funding raises concerns for the future. 


ARTICLE COUNT (AC): 28,378 
FRACTIONAL COUNT (FC): 21,430 
WEIGHTED FRACTIONAL COUNT (WFC): 19,425 


orth America remains the world’s most 
| \ | dominant region for high-quality sci- 
entific output in the Nature Index, 
with a weighted fractional count (WFC) of 
nearly 19,500. The gap is getting narrower, 
however, between North America and East 
& Southeast Asia thanks to China, which is a 
distant second to the United States but with 
a rapidly increasing number of publications. 
Although the two countries that make up the 
North America region — the United States and 
Canada — enjoy substantial federal funding for 
scientific research, that source has been con- 
strained in recent years and is not expected to 
grow significantly in the foreseeable future. 
Almost half the region’s output in the Index 
is in the life sciences, representing a greater 
share of total science output than in the rest 
of the world. In the United States, much life 
science research is financed by the National 
Institutes of Health (NIH), a government- 
funded agency. The NIH is the country’s 
largest non-defence funder of research and 
the largest biomedical research agency in 
the world. The agency, however, has seen a 
reduction in purchasing power by a fifth over 
the last dozen years as budget increases have 
failed to keep pace with inflation. Canada, 
meanwhile, continues to place emphasis on 
life sciences through the Canadian Institute 
of Health Research. The government's most 
recent strategy for science and technology 


innovation put life sciences and health as one 
of five priority areas, with particular focuses 
on neuroscience, ageing, regenerative medi- 
cine and biomedical engineering. 

North America is one of the most self-suf- 
ficient regions when it comes to published 
papers, and its rates of collaboration are lower 
than the global aggregate in every subject area. 
Ofall the North American articles that include 
researchers from more than one country, 11% 
are within the region, stemming from cooper- 
ation between the United States and Canada. 


“WE'RE LEAVING GREAT 

STUFF ON THE CUTTING 

ROOM FLOOR, BECAUSE 
WE HAVE TO PRIORITIZE.” 


The United States is the collaboration partner 
of choice for most of the world’s countries. 
Both China and South Korea partner with US 
researchers for half of all their international col- 
laborations. The United States is also the num- 
ber one collaborator for Canada, as well as for 
several countries from every region. 

Of the total WFC for North America, 4.5% 
came from articles published in either Nature 
or Science, a higher percentage than any other 
region except for Africa — but that continent's 


NORTH AMERICA ANALYSIS 


Countries’ weighted fractional count (WFC) 
Canada accounted for less than 8% of the region's WFC, 
leaving a lopsided advantage to the United States. 


1,489 


TOTAL WEE: 


19,425 
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4.7% arises from a WFC in those journals of 
only 6, while for North America the total is 870. 


INTERNATIONAL CHALLENGER 

The United States’ WFC in the Nature Index is 
17,936, a drop of 3.5% from the previous year. 
Although with only two years of data it’s dif- 
ficult to assess the significance of that change, 
its position as the lead country for high-quality 
research could soon be in jeopardy. The coun- 
try with the second largest output in the Index 
is China, whose WFC of 6,037 is roughly a third 
of the US count. That number, however, is a 
16% climb for China from 2013. 

US National Science Foundation (NSF) sen- 
ior analyst Carol Robbins says China is not the 
only contender experiencing meteoric growth. 
“The article output of developing countries has 
been growing faster than the European Union 
and the US since at least 2000,” Robbins says. 
But she cautions, “Growth in the developing 
world is going to look more rapid than growth 
in the developed world, because they're starting 
from a lower base.” 

The United States is most productive in life 
sciences, which represents 44% of its 2014 
output. But it is in chemistry and physical sci- 
ences, each accounting for nearly 30% of out- 
put, where the scientific giant enjoyed most 
international acclaim last year. The 2014 Nobel 
Prize in Chemistry, for example, for develop- 
ing a super-resolution fluorescent microscope, 


Relative subject area distribution 


Both Canada and the United States focus on the life sciences, 
generating more than 40% of their outputs in this area.* 


Life sciences 


Earth and environmental 


HB United States 
i Canada 


Physical sciences 


*Each slice represents the proportion each subject area contributes to a country's 
overall WFC. Subject areas can overlap, so the total percentage may exceed 100%. 
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went to Eric Betzig of Howard Hughes Medical 
Institute and William Moerner of Stanford Uni- 
versity, along with Stefan Hell from Germany’s 
Max Planck Society. 

Even as its scientists produce award-winning 
results, US research spending continues to be 
curtailed, with steady erosion in federal fund- 
ing for science since 2004, according to Matt 
Hourihan, director of the R&D Budget and 
Policy Program for the American Associa- 
tion for the Advancement of Science. Between 
2010 and 2015, non-defence spending 
for research dropped by 5%, adjusted 
for inflation. And although President 
Barack Obama has proposed a 7% rise 
in spending for 2016, “that is a much 
bigger increase than Congress is likely 
to grant this year,’ Hourihan says. 

Since the federal sequester cuts in 
March 2013, which imposed compre- 
hensive reductions in federal spend- 
ing, there's been a modest recovery 
in such expenditures, Hourihan says. 
He adds that spending is expected to 
grow only moderately until 2021. “The 
pool of funding from which science 
can draw is going to remain very tight 
for the next few years, unless Congress 
acts to change it,” he says. 

The NIH has had a turbulent cou- 
ple of decades financially. Its budget 
doubled between 1998 and 2003 but 
has dropped by a fifth since then — 
though it bounced back from a 2013 
low. With a little more money in its 
budget, the percentage of grant applica- 
tions the agency approved rose some- 
what to 18.1% in 2014 from 16.8% in 
2013, according to NIH deputy director for 
extramural research Sally Rockey, although that 
doesn’t compare to rates above 30% in the late 
1990s and early 2000s. “We're leaving great stuff 
on the cutting room floor, because we have to 
prioritize,’ adds NIH principal deputy director 
Lawrence Tabak. 

The long-term consequences of this budget 


uncertainty are worrying, warns Shirley Tilgh- 
man, former president of Princeton University 
and a past participant in NIH advisory groups. 
The growth in NIH funding until 2003 encour- 
aged many young people to enter the biomedi- 
cal field, she says, but now there’s not enough 
funding to support their research careers. 
“There are too many people chasing too little 
money in the US,’ she says. 

Tilghman, along with three other luminar- 
ies — Bruce Alberts, former president of the 


Parallelized RESOLFT nanoscopy reveals the protein keratin in living 
cancer cells in amazing detail. 


National Academy of Sciences, Marc Kirsch- 
ner of the Harvard Medical School, and former 
director of the National Cancer Institute Harold 
Varmus — wrote a perspective in the Proceed- 
ings of the National Academy of Sciences last year 
calling on the country to revamp its biomedical 
research system. Tilghman argues in the paper 
that, for some years, graduate students and 
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Top 5 institutions’ relative subject area distribution 
Harvard University, the region's leader by WFC, publishes 
about 70% of its output in the Index in life sciences.* 


BB Harvard Life sciences 
University 
i Stanford 
University 
IB Massachusetts 
Institute of Technology 
OB University of 
California Berkeley 
© National Institutes 
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Earth and environmental 


*Each slice represents the proportion each subject area contributes to an institution's 
overall WFC. Subject areas can overlap, so the total percentage may exceed 100%. 
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postdocs have been treated as a source of cheap 
labour for biomedical research, Tilghman says 
many of them don't manage to start their own 
lab until their late 30s. And the competition for 
scarce funding can make scientists more con- 
servative in their thinking, she says, because 
they want to give funding agencies what they're 
expecting rather than propose a new idea whose 
outcome is uncertain. “Over time, the quality of 
science is going to go down because people are 
too scared to take big risks,” she says. 


CANADA'S EYE ON COMPETITION 
Canada’s WFC (1,489) is essentially 
unchanged from last year. That main- 
tains its place as the country with the 
7th largest WFC in the Index, way 
above its 37th place ranking in terms 
of population, according to data from 
the United Nations. Like the United 
States, Canada’s output in the Nature 
Index is strongest in the life sciences, 
which accounts for about 40% of the 
country’s total WFC. The nation’s 
WEC of 33 in either Nature or Sci- 
ence is 2% of its WFC in 2014. Yet in 
Canada too there are worries about 
the nation’s future competitiveness 
in the global research landscape, says 
Paul Dufour, an adjunct professor at 
the University of Ottawa's Institute for 
Science, Society, and Policy. “The state 
of science and technology innovation 
in this country is in decline,” he says. 
Among his concerns isa fall in research 
investment. According to the Organiza- 
tion for Economic Co-Operation and 
Development (OECD), Canada’s gross 
domestic expenditure on research and devel- 
opment dropped as a percentage of GDP from 
2.0% in 2001 to 1.6% in 2013, which is below the 
OECD average of 2.4%. “We're one of the lowest 
in the OECD and it’s still dropping, so it’s not a 
good picture over all? Dufour says. 

Canada weathered the recession of 2008- 
2009 fairly well due to a strong banking system, 
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Top 5 institutions’ collaborativeness 
Over half of the work from researchers at the top institutions in North America 
stems from collaborative efforts with domestic and global counterparts.* 
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*Each bar represents the proportion of an institution's overall output in the 


index (AC) stemming from domestic and international collaborations. 
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Dufour says. Nevertheless, the Canadian gov- 
ernment has emphasized austerity and has cut 
spending in all areas. The country’s national 
statistical agency, Statistics Canada, estimated 
federal spending on science and technology 
would decline 5.4% in the fiscal year 2014-2015 
to about US$8.5 billion, continuing a down- 
ward slide since it peaked in 2010-2011. Prime 
Minister Stephen Harper's office, however, pre- 
fers to focus on a total of about US$8.75 billion 
in new investments in science, technology and 
innovation since 2006. Its priority areas, besides 
health and life sciences, are environment and 
agriculture, energy and natural resources, 
information and communications technolo- 
gies, and advanced manufacturing, ranging 
from aerospace and robotics to three-dimen- 
sional printing and nanotechnology. 

One major government initiative is the 
Canadian High Arctic Research Station, 
costing around US$113 million and billed to 
open in 2017. The country hopes that the sta- 
tion — above the Arctic Circle in Cambridge 
Bay, Nunavut — will attract international sci- 
entists and strengthen Canada’s position in 
polar research. 

In 2014, the government created the Canada 
First Research Excellence Fund. Starting with 
almost US$40 million this year, the fund pro- 
poses to provide about US$1.2 billion over the 
next decade to fund university research that 
creates “long-term economic advantages for 
Canada’, according to its mission statement. 
This funding effort has been controversial, 
however. In a December 2014 editorial, the 
Toronto Star suggested, “It’s a significant 
investment fully undermined by a disastrous 
caveat: to access the funding, applicants will 
have to show that they have private-sector 
co-funders.” Such a focus on research with 
commercial applications means sacrificing 
the basic research that leads to future innova- 
tions, undermining Canadian science in the 
long run, the paper argued. And this is not 
the only sign that Canada’s government wants 
closer ties between research and industry. The 


This Canadian icebreaker takes scientists to study 
the polar environment, and in 2017 the country’s 
High Arctic Research Station should open. 


previous year, Canada’s National Research 
Council had its mission refocused to support 
research that would help Canadian business be 
more competitive. 


“YOU MUST DEMONSTRATE 
THAT YOU HAVE A WELL- 
KNOWN OR WELL-DEFINED 
MARKET OUTCOME.” 


“You have to demonstrate that you have a 
well-known or well defined market outcome,’ 
Dufour says of the new emphasis. “Some of the 
grass roots science community are quite vocal 
on the problems this may create.” For example, 
Evidence for Democracy is a non-profit group 
that formed in response to funding cuts, and is 
urging the government to return to a focus on 
more basic science. 


TOP SCHOOL IMPACTS 
US universities are well represented among 
the world’s leading research institutions in the 
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Index, with Harvard, Stanford, the Massachu- 
setts Institute of Technology (MIT) and the Uni- 
versity of California, Berkeley all among the top 
10 institutions by WFC. Canada’s top school, the 
University of Toronto, is 21st. 

Top US institution Harvard University, with a 
WFC of 865, has the second largest output over- 
all in the Index — behind the Chinese Academy 
of Sciences, which has a WFC of 1,308. While 
the Chinese Academy’s output improved by 8% 
over 2013, Harvard was up only 1.4% on the pre- 
vious year. Nevertheless, this was in the face of 
declining income: Harvard's sponsored research 
budget dropped to US$812 million in 2014, 
down US$9 million from the year before. The 
decline was due to a 5% cut in federal awards, 
to US$605 million, though that was partly off- 
set by a 12% rise in contributions from founda- 
tions and businesses. The university has about 
2,400 faculty members, plus 10,400 academic 
appointments in its affiliated teaching hospitals. 
There are approximately 6,700 undergraduates 
and 14,500 graduate and professional students. 

The school’s greatest strength is in the life 
sciences, which accounted for nearly three- 
quarters of its output. In fact, it’s the Index’s 
world leader in that field, with a WFC of 610, 
nearly double the 320 of the second-placed 
NIH. Harvard's life sciences output is boosted 
through its affiliation with more than a dozen 
leading research and teaching hospitals in the 
Boston area, including the Dana-Farber Can- 
cer Institute and the Joslin Diabetes Center, as 
well as with independent research organizations 
such as the Whitehead Institute for Biomedi- 
cal Research. For instance, researchers at the 
Broad Institute of MIT and Harvard contrib- 
uted to Harvard's total with a December paper 
in Nature Genetics, “Comprehensive Variation 
Discovery in Single Human Genomes.” 

Harvard's WFC of 75 is also the highest of 
contributions to Nature and Science. The uni- 
versity is also in the top 10 for output in both 
physical sciences and chemistry, but comes in 
at 35th for earth and environmental sciences. 

Stanford, with the second-highest output in 
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39, 1% 1% 


4% 


OLD-WORLD 
CONNECTION 


46% 


S6 | NATURE INDEX 2015 | GLOBAL 


Top 3 country collaborations 
When Canadian scientists collaborate outside the country, it is 
with researchers from North America nearly 40% of the time.* 
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*Bars represent international collaboration rates between countries as a proportion of 
the total output (WFC) stemming from international collaborative efforts. 
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Stanford’s SLAC relies on 284 microwave pulse 
generators in its particle accelerator, which is 
more than three kilometres long. 


North America and fifth globally, had a WFC 
of 539, a drop of 2% from 2013. The school’s 
sponsored research budget was US$1.33 bil- 
lion, down US$21 million from the previous 
year. Approximately 82% of that support comes 
from the federal government, which actually 
increased its funding for most of the univer- 
sity’s research; the major drop was caused by a 
$37 million cut in funds for the SLAC National 
Accelerator Laboratory. 

Despite its larger budget — mostly attribut- 
able to SLAC — Stanford is similarly sized to 
Harvard, with 2,118 faculty members in 2015, 


7,018 undergraduates and 9,118 graduate 
students. At Stanford, life sciences and physi- 
cal sciences each represent about 40% of its 
contributions, with chemistry covering about 
30% and earth and environmental science 5%. 
(Nature Index assigns some articles to more 
than one field of science, so the totals don’t 
come to 100%.) Stanford contributes heavily 
to the Journal of High Energy Physics, in part 
because of its work on the ATLAS detector at 
the Large Hadron Collider (LHC) in Switzer- 
land. This journal published upwards of two 
dozen Stanford-authored papers on ATLAS- 
related work in 2014. But, despite its funding 
cut, SLAC shines as well. Ina November paper 
in Nature, “High-Efficiency Acceleration ofan 
Electron Beam in a Plasma Wakefield Accel- 
erator,’ SLAC researchers showed how they 
might build a cheaper, more compact particle 
accelerator than the kilometres-long LHC. The 
paper garnered quite a bit of attention in news 
media, blogs and Twitter, according to Altmet- 
ric, which looks at the social-media impact of 
journal papers. 

With the third highest WFC in the United 
States and sixth globally, MIT also made a 
splash in particle physics at the LHC. More than 
50 MIT researchers and students were involved 
in a different LHC instrument, the compact 
muon solenoid, and helped confirm the dis- 
covery of the long-sought Higgs boson, which 
underpins why objects have mass. A June 22, 
2014 paper, “Evidence for the Direct Decay of 
the 125 GeV Higgs Boson to Fermions,’ attrib- 
uted to hundreds of researchers, including 31 
at MIT, scored higher for public attention than 
any other paper ever in Nature Physics, accord- 
ing to Altmetric data. 

MIT distributes its contributions fairly 
evenly — 42% in physical sciences, 37% in life 
sciences, and 34% in chemistry, though only 
7% in earth and environmental sciences. That 
makes it much stronger than the country as 
a whole in physical sciences and chemistry. 
Its sponsored research budget of US$1.28 bil- 
lion includes US$739 million in Department 
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Contributions to Nature and Science 
Harvard University surpasses the global level by more than 
2.5 times in this metric of high-profile publishing.* 
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*Each bar represents an institution's or country's relative contribution to publications in 
Nature and Science, given as a proportion of its overall WFC. 
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of Defense funds for its defense research arm, 
Lincoln Laboratory. 

The University of Toronto — with about 
7,000 full-time faculty, 2,700 postdocs and 
research associates, 15,000 graduate students 
and a science budget of almost US$1 billion 
— is Canada’s largest research institution. Its 
WEC for 2014 rose 5% to 269.The country has 
two other schools in the top 100: McGill at 68 
and the University of British Columbia at 70. 
Like Harvard, the University of Toronto has 
an affiliated network of several local hospitals, 
and its largest contribution is in life sciences, 
where it comes 9th worldwide. 


SHARING SCIENCE, GOING BIG 

Sharing the cost of research and working across 
disciplines is becoming common, as expansive 
and expensive projects such as the LHC draw 
on resources. “There's generally more collabo- 
ration, because of technology, because science 
is so complex,” says the NSF’s Hill. 

The NIH announced two large initiatives in 
2014, both of which will bring together large 
numbers of people from different institutions. 
The Accelerating Medicines Partnership, which 
aims to develop new treatments for Alzheimer’s 
disease, diabetes and autoimmune disorders, 
has a budget of US$230 million over five years 
and will entail partnerships between NIH, 
pharmaceutical companies and non-profit 
groups. The Brain Research through Advanc- 
ing Innovative Neurotechnologies (BRAIN) 
initiative seeks to derive a more complete 
understanding of the human brain. It received 
US$46 million in 2014 to start a 12-year pro- 
ject that will involve more than 100 researchers 
from several countries. 

Despite the promise of these bold projects, 
the NIH’s Rockey still has worries over con- 
tinued budgetary constraints. “At some point 
there's going to be diminishing returns, simply 
because we dont have the funds to keep up,” she 
says. “We have every reason to be optimistic 
about the future scientifically. We just have to 
get our house in order.” m 


© 2015 Macmillan Publishers Limited. All rights reserved 


Social impact 
Not surprisingly, the world's number one Altmetric score in 
2014 came from an article about social networking.* 


[173 news outlets 
()}115 blogs 
13,313 tweets 
1 peer review site 
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HB 3 Wikipedia pages 
[BB 115 Google+ users 
13 Redditors 
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*Altmetric score correct as of 21 May 2015. Experimental evidence of massive-scale emotional contagion 
through social networks. Proceedings of the National Academy of Sciences. DOI: 10.1073/pnas.1320040111. 
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North & West Europe 


Strong spending and recruitment keep countries in this region 
near the top of the world’s output in the Index, and scientists 
from universities collaborate more with their neighbours. 


ARTICLE COUNT (AC): 25,137 
FRACTIONAL COUNT (FC): 18,382 
WEIGHTED FRACTIONAL COUNT (WFC): 15,785 


orth & West Europe accounted for 
Ne 30% of the publications in the 

Nature Index based on the weighted 
fractional count (WFC), second only to North 
America. Moreover, this region scored highly 
on the Index’s WFC in all four categories: first 
in physical sciences, second in life sciences and 
earth and environmental sciences, and third 
in chemistry. 

Germany (number 3 in the global Index) put 
in the region’s strongest performance, edging 
the runner-up United Kingdom (number 4 
globally) by nearly 25% according to WFC. In 
addition, Germany ran away with the region's 
leading WFCs in chemistry and physical sci- 
ences, topping the second-placed United King- 
dom by about 40% in the former and 60% in 
the latter. But the UK led the region in the life 
sciences, according to the Index, with a WFC 
about 10% higher than Germany. And both the 
UK and France surpassed Germany in earth 
and environmental sciences. 


PURCHASING PRESTIGE 

Germany’s scientific strength is cemented by 
consistently high research spending — more 
than US$100 billion annually and the world’s 
fourth highest. Data from the Organization 
for Economic Co-operation and Development 
(OECD) show Germany’s public and private 
research spending as 2.9% of GDP in 2013, 
which is a full percentage point higher than 


the European average. “We are aware that this 
is not the case in all European countries, and 
this is a clear competitive advantage not only in 
Europe but also in comparison to the US,” says 
Rolf Zettl, managing director of the Helmholtz 
Association, Germany’s largest publicly funded 
scientific research organization. 


“THEY LIKE TO HAVE THAT 
EUROPEAN FUNDING 
EVEN IF IT COMES WITHA 
LOT OF RED TAPE.” 


Several European countries increased pub- 
lic research spending during the financial 
downturn that began in 2007. “By keeping up 
the activity level they helped mitigate the fall 
in business R&D expenditure,” says Gernot 
Hutschenreiter, head of country innovation 
policy reviews at the OECD. However, he adds, 
“this buffering is now over and public expendi- 
ture is either stable or falling.” Even the Ger- 
man government's pledged 3% annual increases 
from 2016 until 2020 are a reduction from the 
current 5%, 

In the UK, R&D spending was only 1.6% 
of GDP in 2013 — considerably lower than 
Germany and also France, which spent 2.2%. 
Alex Halliday, vice-president of the UR’s Royal 
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Countries’ weighted fractional count (WFC) 


Germany and the United Kingdom accounted for 46% of this 


region's output, and France made up another 15%. 
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Society, says, “I think government does under- 
stand that we have to spend more money in the 
future, but it’s quite a hard thing to do at the 
same time as you are having to make massive 
cuts elsewhere.” Still, the quality of UK science 
remains high and it leads the region in WFC 
for papers published in Nature and Science with 
128, or 3.9% of its total WFC — compared to 
the global aggregate of 3.1%. 

In part, Halliday attributes the UK’s high- 
standard papers in the Index to its intensely 
competitive research auditing system, the 
Research Excellence Framework (REF), most 
recently published in 2014 and used to deter- 
mine each university's funding allocation. Hal- 
liday, who returned to the United Kingdom after 
working in Switzerland and the United States, 
says, “I gradually became a convert because I 
began to see what it was achieving for the UK 
that other countries didn't have.” He adds, “It has 
forced universities to think about how to build 
research strength in a very competitive way.’ 
UK excellence is often hampered by an ageing 
university infrastructure, and cannot compare 
to the kind of facilities available at the top uni- 
versities in Switzerland, Halliday says. 

Switzerland (4th in the region by WFC and 
8th globally) spends around 2.9% of GDP on 
research, and generally outperforms relative to 
its small size; its 2014 WEC for Nature and Sci- 
ence papers was 44. The Netherlands (6th in the 
region, 14th globally) spent a lower percentage, 


Relative subject area distribution 


Most of the top producers in this region published similar outputs 


in all categories except earth and environmental sciences.* 
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Scientists supported by the French National Centre for Scientific Research (CNRS) explore subatomic physics. 


1.98% of it GDP in 2013, but its WFC of 29 in 
Nature and Science ranks it in the top five. The 
country has concentrated funding on a small 
number of talented individuals, says molecular 
geneticist Hans Clevers, president of the Royal 
Netherlands Academy of Arts and Sciences. 
“There is a lot of emphasis on publication in 
top journals, but there are ongoing discussions 
asking if the country is emphasising the top 5% 
talent too much,’ he says. 

According to the Index, scientists in North 
& West Europe collaborate more with their 
immediate neighbours than any other region; 
51% of all their international collaborations 
occurred within the region in 2014. When 
they do team up with researchers from further 
afield, it is usually with scientists in the United 
States. European Union (EU) funding partly 
explains the high collaboration level within 
the region. Although the EU research budget is 
only 5-7% of the total national budgets, Katrien 
Maes, chief policy officer of the League of Euro- 
pean Research Universities, says universities 
see EU projects as a prestigious supplement to 
national funding. “They like to have that EU 
funding even ifit is difficult to get and comes 
with a lot of red tape,’ she says. 


INSTITUTIONAL IMPACTS 

By WEC rankings, all three top institutions in 
the Index for this region are national research 
organizations. The French National Centre for 


Scientific Research (CNRS) takes the region’s 
top spot. Its overall WFC in the Nature Index is 
762, which is third globally — behind the Chi- 
nese Academy of Sciences at 1,308 and Harvard 
University at 865. Regionally, Germany's Max 
Planck Society and the Helmholtz Association 
of German Research Centres follow CNRS. 


“MAX PLANCK LOOKS 
FOR THE SMARTEST 
SCIENTISTS AND PROVIDES 
THEM WITH PARADISE.” 


CNRS employs more than 11,000 research 
staff with civil servant status and runs 10 
fully funded labs. The 2014 annual budget of 
more than US$3.5 billion represents a quarter 
of French public civilian research spending. 
Publications skew toward the physical sci- 
ences, with 41% in this field, compared to the 
global aggregate of about 35%. A 2014 OECD 
report called into question the “stultifying pre- 
eminence” of the CNRS in French research and 
labelled it a major handicap. The report noted 
slow reform of a bureaucratic system. “France 
is still in a transition and there is a kind of 
hybrid model now, which is very complex, and 
this creates difficulties,’ says Hutschenreiter. 
Funding pressures are also an issue, and French 
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scientists protested a 1% cut to government's 
2015 research budget. 

Nearly 30% of Germany’s overall WFC comes 
from papers published by scientists at the Max 
Planck Society and Helmholtz Association. 
The centres are run independently, with full 
scientific autonomy. The Max Planck Society 
consists of 83 facilities and in 2014 employeda 
total of 5,516 scientists, with an overall budget 
of more than US$1.75 billion, coming jointly 
from federal and state budgets. Since its found- 
ing in 1948, the MPS has produced 18 Nobel 
Prize winners and this reflects the society’s 
ethos of researcher-centred excellence. “The 
Max Planck position is to look for the smartest 
and brightest scientists worldwide and provide 
them with paradise,” says Zettl. The Max Planck 
has 2,950 papers in the Nature Index, which 
represents at least half a paper per scientist. It 
shows particular strengths in physical sciences, 
with 46% of all its overall WFC in the Index in 
this area and 4.7% of its publications in Nature 
or Science — considerably higher than the 2.6% 
for the CNRS. 

The newer and lesser-known Helmholtz 
Association was founded in 2001 from a col- 
lection of 18 independent research centres. 
The Helmholtz association has a WFC of 440, 
with 49% in the physical sciences — perhaps to 
be expected given its inclusion of Germany’s 
Deutsches Elektronen-Synchrotron (DESY) 
located in Hamburg and Zeuthen, near Berlin. 
As well as exploring particle physics, the facili- 
ties three large particle accelerators provide 
extremely powerful X-ray radiation used to 
study new materials and biomolecular pro- 
cesses. Helmholtz is also strong in earth and 
environmental sciences: 16% of its WFC is in 
these fields, almost double the global aggregate. 

The Helmholtz Association provides 
programme-centred strategic research on 
pressing problems in energy, the environment, 
health and transport. Its nearly US$3 billion 
2015 budget comes mainly from the federal 
government with a 10% contribution from 
local states, plus more than US$1 billion from 


Top 5 institutions’ relative subject area distribution 
The region's top institution, the French Centre for Scientific Research, 
published 77% of its WFC in chemistry and physical sciences.* 
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Top 5 institutions’ collaborativeness 
The percentage of collaboration by this region's 
leading institutions resembled the order of output.* 


Collaborations (%) 


*Each bar represents the proportion of an institution's overall output in the 
index (AC) stemming from domestic and international collaborations. 
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additional research grants from other sources, 
such as EU programmes. 


UK UNIVERSITIES EXCEL 

For the rest of the region’s top-10 institutions, 
the UK’s university sector dominates, includ- 
ing the University of Cambridge, the University 
of Oxford, Imperial College London and Uni- 
versity College London (UCL). Switzerland's 
top universities, the Swiss Federal Institutes of 
Technology in Zurich (ETH Zurich) and Laus- 
anne (EPFL), take the other spots. Maes says 
the strength of these institutions is linked to 
their autonomy, particularly in relation to aca- 
demic recruitment. “Research is a global com- 
petition and our universities are in the search 
for talent,’ she says. 

Cambridge and Oxford are the oldest univer- 
sities in the English-speaking world, established 
in 1209 and 1096, respectively. Based on the 
WEC ofall institutions globally, Cambridge is 
ranked 9th in the Index and Oxford 11th, with 
WECs of 403 and 356. Papers published in 
Nature and Science account for 5.0% of Oxford’s 
WEC and 4.5% of Cambridge’s — both signifi- 
cantly higher than the global aggregate of 3.1%. 

The subject spread of Cambridge publications 
is close to the global aggregate with a slightly 
stronger performance in the physical sciences. 
One 2014 article in the Index came from Cam- 
bridge materials scientist, Jason Robinson, and 
colleagues. This article in Nature Communica- 
tions describes a breakthrough in the field of 
spintronics, a technology that could help create 
super-fast computers, processing data using the 
electron property ‘spir’ rather than charge. The 
paper provides the first evidence that supercon- 
ducting materials, which carry current without 
losing energy, can be made into data process- 
ing devices using a layered magnetic material 
containing the element holmium. Robinson 
says one of Cambridge's strengths is the sense 
ofa collective way of working. He says, “When 
I first arrived there wasn't as much interaction 
between departments, but now I feel there is a 
lot of collaboration going on” 


The University of Cambridge ranked ninth among 
the world’s top institutions in the Nature Index. 


“RESEARCH IS A GLOBAL 
COMPETITION AND OUR 
UNIVERSITIES ARE IN THE 
SEARCH FOR TALENT.” 


Oxford shows a stronger performance in the 
life sciences, with 43% of its WFC in this area, 
compared to a regional aggregate of the WFC 
of about 30%. Overall the university has the 
highest research income of any UK university. 
It has been particularly successful in bringing 
in funding for new research centres, particu- 
larly at the interface between physical sciences 
and biomedicine, such as Oxford’s Institute for 
Biomedical Engineering, where engineers and 
clinicians work together to develop medical 
devices and technologies in areas including 
medical imaging and regenerative medicine. 
“That interface is going to generate new tools, 
new devices, new technology, new drugs, and 
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International collaborations 
When scientists in this region collaborate, it is usually with 
scientists from other countries in the region. 


that’s where we need to invest in the future,” 
says Halliday. Oxford is also strong in bioin- 
formatics and statistical genetics, illustrated 
by a Nature Communication in the Index, from 
alarge interdisciplinary team led by Ian Tom- 
linson from Oxford’s Wellcome Trust Centre 
for Human Genetics. The communication 
reports the whole genome sequencing of blad- 
der cancers and uncovered genetic mutations 
that provide potential drug targets as well as 
markers for diagnosis. These results are part 
of a wider genomics collaboration, with the 
technology company Illumina, studying the 
genomes of 500 patients with hard-to-treat 
diseases. 

Unlike most Swiss universities top institu- 
tion ETH Zurich receives 80% of its budget 
of more than US$1.6 billion from the federal 
government to support 5,000 scientific staff. 
ETH shows strength in physical science and 
chemistry, which make up about 40% and 
37% of all publications. Although only 22% 
of its 2014 WEC was in life sciences, that is an 
increase from the 2013 level of 16%. In 2012 
ETH opened a new department of health sci- 
ence and technology, which takes a systemic 
and multidisciplinary approach to health, from 
the molecular scale to social context. 

ETH’s strength in chemistry is not surpris- 
ing given the county’s Basel-based pharma- 
ceutical industry, in particular, Roche, which 
is eighth in the region’s institutions publish- 
ing in Nature and Science. In 2014, Roche 
outspent all other pharmaceutical companies 
with a global R&D budget of US$9 billion, and 
is the only company represented in the top-10 
of any region. The company has many collab- 
orations with leading academic institutions, 
and it “strongly encourages our scientists to 
share their findings in both peer-reviewed 
publications and presentations at congresses,” 
says John Reed, head of pharma research and 
early development. 

This region’s educational and research insti- 
tutes and industry promise a strong output of 
scientific research for years to come. m 


Top 3 country collaborations 


When collaborating outside the region, scientists largely work 
with colleagues from the United States.* 
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*Bars represent international collaboration rates between countries as a proportion of 
the total output (WFC) stemming from international collaborative efforts. 
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East & Southeast Asia 


While remaining strong in chemistry and physical sciences, 
buoyed by China’s growing contribution, the region 
embarks on collaborations that could extend its reach. 


ARTICLE COUNT (AC): 16,277 
FRACTIONAL COUNT (FC): 12,092 
WEIGHTED FRACTIONAL COUNT (WFC): 11,464 


ast & Southeast Asia was the third larg- 
B= regional contributor to the Index 

behind North America and North & 
West Europe. Its 2014 WFC of 11,464 is a 
6% increase from 2013. China, the second- 
ranked country in the global Index, increased 
its WFC by 16% over 2013. In comparison, 
North America’s WFC decreased by 4% and 
North & West Europe’s dropped, too, but by 
less than 1%. 

“China is growing much, much faster than 
the US,” says Derek Hill, a science resources 
analyst at the National Science Foundation 
(NSF). “And China has been investing enor- 
mous resources in education and research and 
development” 

Overall, the scientific output from East & 
Southeast Asia reflects the economic status 
of the region. The combined GDP of these 
countries makes up about 22% of the global 
GDP. Likewise, this region accounts for 22% 
of the Nature Index’s weighted fractional 
count (WFC) for 2014. For comparison, 
North America accounts for about 24% of the 
world’s GDP, yet it produces 38% of the Index’s 
total WFC. 

One area in which East & Southeast Asia 
dominates the Index is the output of chemistry. 
The region’s chemistry WFC in 2014 of 6,062 
makes up one-third of the global output in this 
field. Physical sciences represent the region's 
second highest area of publishing output in the 


In Nanjing, China, BASF and Sinopec collaborate to increase the production of agricultural chemicals. 


Index, but only reach about the global aggre- 
gate in terms of the percentage of the region's 
total WFC. The region’s contributions to the 
Index in both life sciences and earth and envi- 
ronmental sciences are about half the equiva- 
lent proportions that those areas contribute to 
the global aggregate WFC. Among articles in 
the 2015 Index, scientists in East & Southeast 
Asia collaborate with scientists in other coun- 
tries in about 25% of projects. 
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Countries’ weighted fractional count (WFC) 


In combination, China and Japan produced more than 80% 


of this region's contribution to the Index. 
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In earth and environmental sciences, 
though, the scientists in the region collaborate 
80% of the time. All of the regions in the Index 
collaborate most in this subject category. 


THE TRADITIONAL TOP THREE 

China dominates East & Southeast Asia’s 
Index performance: its WFC of 6,037 accounts 
for exactly half of the region’s publishing 
output. Second-placed Japan, with a WFC 


Relative subject area distribution 
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of 3,200, makes up another quarter. Third- 
placed South Korea was responsible for 10% 
of the region’s WFC. Contributions from 
other countries split up the remaining 15%, 
with none of them providing even 5% of the 
output in the Index. 

The top three country contributors to the 
Index from this region are very strong in 
high-quality chemistry and physical sciences 
research. Chemistry accounts for a whopping 
60.3% of China’s WFC; the global aggregate 
is 35.6%. 

China's research in chemistry covers applied 
and basic science. In the March Journal of the 
American Chemical Society, for example, sci- 
entists from China’s Wuhan University and 
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Central China Normal University described 
the synthesis and crystal structure of a non- 
linear optical material that resists damage 
from lasers. The Index articles for 2014 also 
include a January paper in Inorganic Chem- 
istry in which scientists from China's Jiangsu 
University revealed new magnetic properties 
of iron compounds. 

In addition to extensive publishing on 
chemistry in 2014 data in the Index, US-based 
IHS Chemical, which offers industry market- 
ing information and technical expertise, says 
that Asia — especially China — “will remain 
the dominant global force for chemicals 
beyond 2020” 

Similar research concentrations exist in 
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other leading countries in the region. Of Japan's 
WEC from 2014 articles in the Index, 41% and 
35% come from chemistry and physical sci- 
ences. For South Korea, chemistry and physi- 
cal sciences provide 51% and 46% of its WFC. 

When scientists in East & Southeast Asia 
collaborate internationally, they tend to look 
to the west. In fact, 45% of collaborations 
are with scientists from North America. The 
second most likely source of collaborators, 
accounting for 28% of them, is North & West 
Europe. Other scientists from the same region 
only make up 18% of international collabora- 
tions, and other parts of Asia add only a few 
percent of the collaborators. 


“CHINA WILL REMAIN 
THE DOMINANT GLOBAL 
FORCE FOR CHEMICALS 

BEYOND 2020.” 


CUTTING-EDGE INSTITUTIONS 

By far the region’s top institutional contributor 
to the Index is the massive Chinese Academy 
of Science (CAS), which takes the number one 
spot in the world. Its WFC of 1,308 — up by 
8.1% since 2013 — is more than 50% higher 
than second-placed Harvard’s and accounts 
for 2.6% of the global output. Like China in 
general, CAS published strongly in chemistry 
and physical sciences, with WFCs of 771 and 
413 leading the world in these disciplines. One 
example of CAS’s physical sciences prowess 
can be found in a 2014 Nature article by two 
CAS researchers and 11 other authors. The 
paper shows that single-atom layers of gra- 
phene, as well as of hexagonal boron nitride, 
have unexpectedly high permeability to pro- 
tons at room temperature. This permeability 
makes such monolayer crystals “promising 
candidates for use in many hydrogen-based 
technologies,’ the authors conclude. 


Top 5 institutions’ relative subject area distribution 
Chemistry articles made up more than half of this region's output, 
and the Chinese Academy of Sciences led this category globally.* 
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Top 5 institutions’ collaborativeness 
The Chinese Academy of Sciences published the most 
but collaborated the least among the leading institutions.* 
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“Each bar represents the proportion of an institution's overall output in the 
Index (AC) stemming from domestic and international collaborations. 
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Scientists at the Chinese Academy of Science, which has the highest institutional WFC in the Index, 
explore basic and applied science, including clinical research. 


Japan’s University of Tokyo came in second 
amongst the region’s institutions. Its WFC of 
456 placed it at number 7 globally. Like CAS, 
this university published largely on chemistry 
and physical sciences, with respective WFCs 
of 144 and 205. 

Relatively speaking, this university puts 
more emphasis on the life sciences than CAS, 
with a WEC of 99. In addition, about 8% of 
the University of Tokyo's WFC goes to articles 
in earth and environmental sciences, which is 
about the proportion of the global aggregate 
WFC in this area. 

For example, University of Tokyo earth 
scientist Masumoto Yukio and his colleagues 
reported in Nature in 2014 on the impact 


of global warming in currents in the Indian 
Ocean. They noted that ongoing changes will 
continue to drive more extreme weather. 

The region’s number three institution, Chi- 
nas Peking University, generated a 2014 WFC 
of 292, and came in 17th globally. Like CAS 
and the University of Tokyo, Peking University 
focuses on chemistry and physical sciences. 

In addition, this university relies on col- 
laborations to expand its strengths in other 
areas. For example, Peking University Clini- 
cal Research Institute (PUCRI) maintains an 
ongoing collaboration with New York-based 
Medidata, which provides cloud-based, life 
science solutions. 

Many companies in China, who still keep 
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track of clinical trials on paper, will be able to 
utilize Medidata’s electronic systems. Takeru 
Yamamoto, Medidata’s managing director of 
Asia-Pacific region, says, “Our goal remains to 
enhance the speed and operational efficiencies 
of clinical studies, so that we can enable life 
science companies to bring new therapies to 
patients more quickly — both from the local 
market and around the world.” 


“OUR GOAL REMAINS 
TO ENHANCE THE SPEED 
AND EFFICIENCIES OF 
CLINICAL STUDIES.” 


SPREADING EXPLORATIONS 

China contributed 147 articles in Nature and 
Science in 2014 — the most for the region, just 
ahead of Japan with 137. Some of China’s out- 
put that landed in these high-profile publica- 
tions came from areas outside its typical focus 
on chemistry and physical sciences. 

In the October Nature, for example, a group 
of researchers from various institutions in 
China and one in the United States, reported 
on the best methods of crop management to 
produce more food with fewer environmen- 
tal consequences. In addition, some of these 
articles combine China’s strength in chemistry 
with other areas. As an example, a July Nature 
article from researchers at the Chinese Acad- 
emy of Sciences and the University of Science 
and Technology of China explored the bio- 
chemistry of lipids in search of clinical targets 
for fighting bacterial infection. 

These researchers, as well as others across 
East & Southeast Asia, are thus using their 
foundational strength in chemical sciences to 
study not only that area but a host of related 
ones. Such work is fuelling the remarkably 
robust surge in the region’s scientific research, 
as reflected in a rising WFC in the Index. m 


International collaborations 
Nearly half of the region's collaborators work in North America, 
and more than a quarter come from North & West Europe. 
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Top 3 country collaborations 
In China, Japan, South Korea and Taiwan, papers with US coauthors 
account for more than 30% of such international collaborations’ WFC.* 
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Central & East Europe 


Social change and economic stagnation create challenges 
for this region’s biggest countries, but some of the smaller 


ones are shining. 


ARTICLE COUNT (AC): 3,275 
FRACTIONAL COUNT (FC): 1,377 
WEIGHTED FRACTIONAL COUNT (WFC): 1,091 


( hange and upheaval were the two main 

forces at work in this region in 2014. 

To the south, Greek scientists hope for 
positive outcomes from the election of the new 
anti-austerity government. To the east, Russia's 
science is still in turmoil in the wake of eco- 
nomic sanctions, while its funding structure 
is undergoing reform. Despite this, the coun- 
try still led the region in 2014 as measured by 
the Nature Index’s weighted fractional count 
(WFC). Overall, as with last year’s Index, the 
regional output is heavily dominated by the 
physical sciences, reflecting the emphasis placed 
on military and applied science by the former 
communist regimes, and the effects of ideology 
on life science research. 

Maintaining second place in the region and 
24th globally by WFC is Poland, the 7th larg- 
est economy in the European Union (EU), and 
one of its fastest growing. Poland comes second 
in the region (behind Russia) in physical sci- 
ences and chemistry WFCs and second in life 
sciences (after Hungary). Like other countries 
in the region, Poland also scores highly on col- 
laborativeness; with 60% of its output coming 
from international collaborative efforts it far 
surpasses the United States (26%), Germany 
(47%) and the United Kingdom (50%) in form- 
ing international collaborations. 

Poland is still working to reform its academic 
and research institutions, and to increase the 
economic impact of its scientific efforts. To 


these ends, three key policy developments have 
taken place in the past year, says Maciej Zylicz, 
president of the not-for-profit Foundation for 
Polish Science. The first is a drive to improve 
research quality, by changing how institutions 
will be assessed. “The system is very flat at the 
moment,’ says Zylicz. “Poland would like to give 
more money to the best institutions.” As of Janu- 
ary 2015, the government is making its funding 


“UNIVERSITIES CANNOT 
COVER ALL NEEDS 
WITHOUT WORLDWIDE 
NETWORKING.” 


decisions based entirely on performance. 

The second big development is opening up 
Poland’s R&D infrastructure to make it more 
international. That entails attracting students 
from abroad, changing the law to make it easier 
to appoint foreign nationals to academic posts 
and making the appointment process more 
open to competition. Thirdly, the government 
is trying to forge links between science and busi- 
ness, with a new law being passed that offers 
firms tax breaks to invest in R&D. The private 
sector currently accounts for about a third of 
Poland’s R&D effort, says Zylicz. “If we can 
achieve 50:50, this will be a big achievement.” 
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Countries’ weighted fractional count (WFC) 
The combined publications from Russia and Poland in the 
Index made up 53% of the region's output. 
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According to Eurostat, the EU’s statistical 
office, Poland’s 2013 R&D spend remained 
static at US$3.66 billion, which represents a 
slight fall as a percentage of GDP from 0.9% in 
2012 to 0.87% in 2013, one of the lowest in the 
EU. The government is, however, committed to 
raising this to 2% of GDP by 2020, says Zylicz. 
This means it will have to boost spending by 
between 10 and 15% per year; the increase for 
2015 will be 10%. “We are on a good track,’ says 
Zylicz. Funding is also being directed to help 
Polish scientists compile grant applications for 
Horizon 2020, the EU’s research and innova- 
tion programme that will offer nearly US$85 
billion of funding between 2014 and 2020. 


CHECKING FUNDING SHORTFALLS 

Third in the region, and 27th globally is the 
Czech Republic, whose 2014 WFC was 6.7% 
higher than in 2013. Like several of its neigh- 
bours, the Czech Republic’s output is domi- 
nated by physics and chemistry — with 38.1% 
and 48.2% of its WFC — and these results 
reflect the country’s historical strengths in 
these fields. 

In 2013, the Czech Republic spent US$3.2 
billion on R&D, or 1.91% of its GDP, a slight 
increase over the 1.88% of 2012. This reflects 
a general stagnation that has taken place since 
2010. The period between 2000 and 2009 
was a golden era for Czech science, when the 
government increased funding by about 10% 


Relative subject area distribution 


Research in the physical sciences is on the forefront of this 


region's top countries.* 


Life sciences 


a 


’ 
’ 
’ 
iG 
1 
1 


HB Russia 
Mi Poland 
) Czech Republic 
Greece 
Mi Hungary 
Other 


Physical sciences 


“Each slice represents the proportion each subject area contributes to a country's 
overall WFC. Subject areas can overlap, so the total percentage may exceed 100%. 


S18 | NATURE INDEX 2015 | GLOBAL 
© 2015 Macmillan Publishers Limited. All rights reserved 


each year, says Vaclav Hofejsi, director of the 
Institute of Molecular Genetics in Prague, and 
one of five scientists advising the Czech Prime 
Minister on science and higher education pol- 
icy. Since then, although the government has 
maintained an annual spend of about US$1.07 
billion, this represents a decrease in real terms. 
Worst affected is the Czech Academy of Sci- 
ences, whose funding has fallen by some 10% 
from its 2009 level, says Hofejéi. 

The government-funding shortfall is cur- 
rently being offset by money from EU regional 
development funds. From 2007 to 2015, 
US$2.13 billion has been invested, mainly in 
building new research facilities. These include 
six European centres of excellence and 40 
regional centres for research and develop- 
ment. One example is BIOCEV, the Biotechnol- 
ogy and Biomedicine Centre of the Academy 
of Sciences and Charles University in Vestec. 
Set to open in September 2015, it will house 
400 researchers and five state-of the-art core 
facilities. “This is a large-scale activity that will 
probably greatly change the basic landscape of 
Czech science,’ says Hofejsi. The future support 
and sustainability of these new research infra- 
structures remains under discussion. 


SMALL BUT STRONG 

Estonia, with only 1.3 million people, punches 
above its weight in science. Its WFC for 2014 
was 22, an increase of 68% compared with 
2013. This is one of the largest WFC increases 
in the region. 

In 2014, Estonia led the region in the pro- 
portion ofits output published in Nature and 
Science — 5% of its WFC, compared to the 
global aggregate of 3.1%. This included a study 
in Nature about interacting genes in humans. 
Russia had more papers in these journals (33), 
but that represented only 1.2% of its WFC. 
Estonia published 12 articles in Nature and 
Science in 2014, the same number as its larger 
neighbours Poland (12 papers, 0.6% WFC) and 
Hungary (12 papers, 2.0% WFC). 

Estonia is often held up as an example of how 


Cryogenic facilities at BIOCEV in the Czech Republic 
provide cell-line storage for biomedical research. 


shrewd planning, tough reform and a robust 
R&D political framework can transform a 
country’s science base. In the early 1990s, 
Estonia rearranged its whole research system 
according to the western model, with its new 
funding system emphasizing research excel- 
lence and competition. In 2001, the country 
set up a Centres of Excellence in Research pro- 
gramme to promote productivity and interna- 
tional competitiveness, which established 22 
centres over successive phases. 

The government continues to prioritize 
R&D funding, says Indrek Reimand, Deputy 
Secretary for Higher Education and Research at 
Estonias Ministry of Education and Research. 
In 2013, Estonia spent 1.73% of its GDP on 
R&D. While this is lower than the EU average 
of 2.2%, government R&D spending tripled 
between 2001 and 2007, and then doubled 
between 2007 and 2013, making its R&D 
spending growth one of the fastest in the world, 
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says Reimand. Over the past decade, Estonia 
has also invested heavily in its science infra- 
structure, refurbishing labs and buildings, and 
between 2007 and 2013 invested some US$426 
million of EU structural funds in R&D capac- 
ity. A quarter of the funding was earmarked for 
human resource development, such as doctoral 
training and schemes to promote transnational 
mobility for researchers and students. 


INSTITUTIONS MOVING UP 

For individual institutions, the Nature Index 
reveals that scientists at the University of Tartu 
coauthored 10 of Estonia's 12 Nature and Sci- 
ence papers. Several of these involved large, 
international genomics projects. This reflects 
the university's policy of developing collabo- 
ration partners to create an internationally 
competitive research infrastructure in genom- 
ics and biotechnology, says Marco Kirm, vice 
rector for Research at the University of Tartu. A 
cornerstone of this effort is the Geenivaramu, 
the Estonian Biobank, which is a comprehen- 
sive collection of genetic and health informa- 
tion covering 5% of the country’s population. 
It draws collaborators from around the world, 
such as those involved in identifying genes 
associated with common diseases. “Universi- 
ties cannot cover all needs — to have people 
with advanced knowledge — without world- 
wide networking,’ Kirm says. 

The Academy of Sciences of the Czech 
Republic, which incorporates 54 research 
institutes, comes third in the region (after 
the Russian Academy of Science and the Pol- 
ish Academy of Science). Its output is heavily 
skewed toward chemistry (47.6% WFC) and 
physics (40.2%). Among this region's top insti- 
tutions, the Hungarian Academy of Sciences 
puts most emphasis on the life sciences, which 
accounts for 36.4% of it WFC. 

Some commentators see the region catching 
up with world leaders. Hosejsi says, “I think 
some of our best research institutions are 
now fully comparable to the good ones in 
Western Europe.’ = 


Top 5 institutions’ relative subject area distribution 
Despite the region's predominant focus on the physical sciences, the Hungarian Academy of 
Sciences kept its eye on the life sciences, devoting 36% of its overall output to this field* 
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Top 5 institutions’ collaborativeness 
Although Russia's Lomonosov Moscow State University is only 
the region's fourth highest producer, it collaborates the most.* 


100 
BO... 
TOree 
60 ---.- 
50 - 

AO... 


Collaborations (%) 


“Each bar represents the proportion of an institution's overall output in the 
Index (AC) stemming from domestic and international collaborations. 
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Australasia & Pacific Islands 


Research is dominated by heavyweights Australia and 
New Zealand but shifts in science policy on both sides of the 
Tasman Sea are creating uncertainty about the future. 


ARTICLE COUNT (AC): 2,705 
FRACTIONAL COUNT (FC): 1,249 
WEIGHTED FRACTIONAL COUNT (WFC): 1,048 


his vast region dotted with small, 
| developing island states that stretch 
from the Indian Ocean to Hawaii 
and down to Antarctica is dominated by the 
giant landmass of Australia alongside New 
Zealand. With great variation in landscapes, 
strong agricultural industries and increas- 
ing vulnerability to climate change, these 
countries have naturally shown a particular 
focus on earth and environmental sciences. 
At the same time, life sciences are also an 
area of strength, well served by government 
support from Australia and New Zealand, 
and increasingly a field in which there is cross- 
border collaboration. 

With the largest populations, economies 
and scientific communities, Australia and New 
Zealand are the region's ranking leaders in the 
Nature Index, at 12 and 31 in the world, based 
on weighted fractional count (WFC) for 2014. 
Meanwhile, among their smaller island neigh- 
bours, where science education and infra- 
structure remains under-developed, Papua 
New Guinea has dropped from 91 to 108, Fiji 
sits at 123, and Tonga at 137. More data will be 
required to analyse any temporal trends. 

Across the region there is ongoing concern 
about policy changes shifting the empha- 
sis towards input for outcome — allocating 
research funding on the basis of anticipated 
impact. 

The ramifications are particularly great for 


young researchers who some fear are being 
compelled to choose whether to devote them- 
selves to applied or blue-sky research too early 
in their careers. 


SUCCESS IN SPECIALIZATION 

Australia is the region’s best Index performer, 
achieving small increases in both its total arti- 
cle count (AC), up 3% from 2013, and in its 
weighted fractional count (WFC), up 2%. In 
contrast, the region's WFC decreased by 1.6% 
from 2013 data in the Index. 


“WE ARE ABOUT TO TAKE 
EVERYTHING WE HAVE 
BUILT UP OVER A DECADE 
AND CRUSH IT.” 


This improvement came during a year of 
uncertainty over science policy, for much of 
which the country did not have a science min- 
ister, and government spending on research and 
development fell to its lowest level as a propor- 
tion of total spending since the early 1980s. 

After an outcry and name-calling — includ- 
ing scientists being branded “precious petals” 
by one minister — the portfolio of minister for 
science was reinstated days before Christmas 
and a new Commonwealth Science Council, 
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Countries weighted fractional count (WFC) 
Australia produced 91% of this region's Index output, and 
New Zealand provided nearly all of the rest. 
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comprising leaders from science academia and 
industry, was formed to advise the government 
on science and technology. However, funding 
for major science infrastructure — including 
the world-class Australian Synchrotron relied 
upon for important Australian and New Zea- 
land research in fields as diverse as biomedi- 
cine and defence — was kept on hold until well 
into 2015. “We are about to take everything we 
have built up over a decade and crush it,” says 
Australian National University’s Nobel Prize 
winning astronomer Brian Schmidt of the 
shortfalls for the country’s science. As recently 
as February 2014, Schmidt had contributed 
to a paper published in Nature identifying the 
oldest-known star in the galaxy — a discovery 
made possible only through the infrastructure- 
funding scheme. After strong ongoing pro- 
test the scheme was extended in the 2015/16 
budget. To the dismay of many, however, the 
additional money to extend the scheme was 
taken from higher education research fund- 
ing and is likely to affect the capacity for many 
university based scientists to continue using the 
facilities. 

In this Index, all but one of the region's top 
institutions are Australian, but there has been 
some movement in their rankings. The Uni- 
versity of Queensland has maintained its lead- 
ing position with a WEC of 109, followed by 
Monash University (103), the University of New 
South Wales (UNSW; 88) and the Australian 


Relative subject area distribution 


Tonga's seemingly strong focus in earth and environmental sciences 
and life sciences came from one article attributed to both categories.* 
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National University (ANU; 87). Looking at the 
total number of publications from each institu- 
tion, the University of Sydney has slipped from 
the top spot it held in 2013 Index data, with the 
ANU achieving 498, to the University of Syd- 
ney’s 492 and the University of Melbourne's 478. 
The country’s government research organiza- 
tion, the CSIRO, was the only non-university 
to reach the region’s top 10. 

Australia’s strongest field for publication out- 
put is the life sciences, where the country ranks 
9th globally. Among the country’s institutions, 
the University of Queensland and University 
of Melbourne lead strongly in life sciences 
publications. Their focus is an example of the 
emerging specialization of the leading institu- 
tions in response to increasingly competitive 
funding regimes. 

UNSW, by contrast, has established itself as 
a force in the physical sciences, which account 
for 46% of the university's publications in the 
Index. In particular, UNSW has established 
itself as a hub for quantum computing. “When 
you have that strength, you can then recruit 
the best people in the world to come here,’ says 
dean of science, Merlin Crossley. 

In chemistry, Australias Monash University 
produced the highest 2014 WFC in the region, 
and it also ranked 100th among institutions 
globally for total WFC. 

Crossley expresses apprehension about the 
direction that research and funding are tak- 
ing — in particular threatened changes to 
the country’s strong fellowship programmes. 
There are good government grant provisions 
for senior researchers, he says. “But for junior 
researchers, I think it is very tough.” 


GHOST OF POLICIES PAST 

Some of the challenges facing Australian 
research — uncertain funding and a loss of 
priority for government — echo those that 
affected science in New Zealand between 2010 
and 2012. For this period, a ministry of science 
was deemed unnecessary, funding rounds were 
put on hold for 12 months, and changes were 


Electron beam lithography plays a part in 
developing quantum computers at the University 
of New South Wales. 


made to grants and funding processes that 
benefitted senior rather than junior research- 
ers. Some scientists attribute New Zealand's fall 
in the ranking to these shifts. “The kinds of 
changes that you see are probably not relatable 
to changes that happened in the last 12 months. 
We're looking at the impact of the changes in 
our science sector that happened about five 
years ago,’ says president of the New Zealand 
Association of Scientists, Nicola Gaston. 

New Zealand’s WEC fell 19% from the 2013 
level, pushing it from 28 to 31 in the global 
rankings. Gaston says that while government 
funding for New Zealand science has been 
increasing, the recent results suggest that the 
emphasis on linking input with outcomes and 
industry partnership undermines the pursuit 
of scientific excellence. “There were a lot of 
concerning things in the last five years and 
we are now seeing the consequences of those 
changes,’ Gaston says. The leading New Zea- 
land institution in the Index, the University of 
Otago in Dunedin, is among the region's top 
10 institutions and has moved up 30 places (to 
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406) in the global top 500. After Otago comes 
the University of Auckland (481), Victoria 
University of Wellington (687) and the Uni- 
versity of Canterbury (890). 

Thanks to strong funding from government 
and industry, New Zealand's greatest research 
strength is in the earth and environmental sci- 
ences, where the country ranks 16th globally; 
these fields account for 37% of the country’s 
WEC. One of the most widely discussed recent 
pieces of New Zealand-led Antarctic research 
is an investigation of the effects of waves on sea 
ice published in Nature in May 2014. Led by 
hydrodynamics scientist Alison Kohout at the 
National Institute of Water and Atmospher- 
ics, the researchers found that the impact of 
large waves extends further into an ice shelf 
than previously thought, potentially affecting 
stability hundreds of kilometres from the ice 
edge. This implies that storms and the waves 
generated therein may play an important role 
in shaping the extent of sea-ice. 


BIG STRIDES FOR SMALL COUNTRIES 

Beyond Australia and New Zealand the num- 
bers for research publications are small, but 
with a similar focus on life sciences and earth 
and environmental sciences. Among the stand- 
outs, the government-affiliated Papua New 
Guinea (PNG) Institute of Medical Research 
contributed to two of the six Index publications 
linked to PNG. One paper that looked at com- 
plex genetic mechanisms behind immunity to 
malaria may lead to future treatments. 

In Fiji, the University of the South Pacific is 
the only institution whose work features in this 
year’s Index with contributions to two major 
papers in 2014 — one looking at sea salinity and 
temperature, and the other at diseased coral. 

Scientists and politicians in both Australia 
and New Zealand are making concerted efforts 
to increase collaboration with research power- 
house, China. In the face of such changeable 
fortunes for local science, these links are likely 
to provide growing support for Australasia & 
the Pacific Islands. m 


Top 5 institutions’ relative subject area distribution 
More than half of the University of Queensland's articles addressed 
issues in the life sciences.* 
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Top 5 institutions’ collaborativeness 
The University of Melbourne's scientists collaborated 
on about 80% of its articles.* 
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“Each bar represents the proportion of an institution's overall output in the 
Index (AC) stemming from domestic and international collaborations. 
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West Asia 


International projects — including an advanced particle 
accelerator and educational facilities — bolster this 
region’s focus on chemistry and physical sciences. 


ARTICLE COUNT (AC): 1,927 
FRACTIONAL COUNT (FC): 853 
WEIGHTED FRACTIONAL COUNT (WFC): 753 


" | “hi is a region divided. Ongoing con- 
flict has stopped some countries from 
sponsoring the development of science 

or maintaining safe facilities for researchers — 
but not all. A number of nations, notably Israel 
and Turkey, have made inroads into supporting 
growth of their scientific communities or are 
adopting plans to do so. 

Israel remains the region's leading scientific 
country and the only nation in the Index to 
break into the global top 20 based on weighted 
fractional count (WFC). Israel’s WFC of 492 was 
significantly higher than both Turkey, second in 
the region with 79, and Saudi Arabia at 76. This 
success has been driven in part by high spending 
levels on R&D, which in Israel is 3.9% of GDP — 
second only to South Korea, according to 2014 
figures by United Nations Educational, Scientific 
and Cultural Organization (UNESCO). 

Israel's R&D spending is even driving high- 
tech work. In 2014, for example, an international 
team — led by Israeli scientists — reported in 
Nature Nanotechnology on a new DNA-based 
structure that might be used in electronics. 
Although this work remains in its early stages, 
the authors concluded: “Our results pave a way 
towards the implementation of DNA-based pro- 
grammable circuits for molecular electronics.” 

Israel's Weizmann Institute is the top institu- 
tion in the region with a WFC of 143, slightly 
below its 2013 figure. It has a relatively evenly 
spread publication count across disciplines 


— producing the highest outputs in life sciences 
and physical sciences, followed by chemistry. 
Globally this institute is 64th based on WFC, and 
it is 21st on the Index’s list of institutions around 
the world for WFC in Nature and Science. 
However, despite these impressive statistics, 
research at Weizmann — and indeed within 
the country as a whole — is lacking in one key 
area. “While Israel invests significantly in R&D, 
particularly in information and communication 
technologies, the investments in biotech might 
be lagging behind,” says Michal Neeman, vice 


“YOU CAN'T JUST IMPORT 
RESEARCH BY HIRING A 
FEW EXPATRIATES WHO 

ARE WORLD RENOWNED.” 


president of the Weizmann Institute. The insti- 
tute is working on filling in these gaps with its 
recently established Nancy and Stephen Grand 
Israel National Centre for Personalized Medi- 
cine. Created in 2013 with US$150 million in 
donations, according to Neeman, the centre 
aims to lead Israel's research in genomics, protein 
profiling, bioinformatics and drug discovery. Its 
steering committee comprises leaders from sev- 
eral universities in Israel, which helps facilitate 
research collaborations across institutions. 
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Region-leader Israel contributed 65% of the output, and Turkey, 


Saudi Arabia and Iran added around 10% each. 
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Overall, Weizmann lags behind the region in 
collaboration. Whilst around half of its articles are 
the result of collaborative efforts, other institu- 
tions in the region are even more prolific in estab- 
lishing research networks to drive publications. 


BIG SPENDERS STILL COLLABORATE 

Other countries in the region are also making 
significant R&D investments. Turkey's invest- 
ment has grown every year since 1995, and is 
now just under 1% of GDP. Since 2000, Turkey 
also has developed national science, technology 
and innovation policies, including the establish- 
ment of the Turkish Research Area (TARAL) 
in 2004. This provides a forum for the coun- 
try’s stakeholders in research and innovation, 
including public agencies, universities, non- 
governmental organizations and the private sec- 
tor. Such investments and policies appear to be 
paying off: the number of researchers employed 
and papers published are both on the rise. In this 
Index, Turkey ranked 36, having overtaken Iran 
and Saudi Arabia. 

To further extend Turkey’s science capabili- 
ties, the government encourages international 
collaboration — to this end the Scientific and 
Technological Research Council of Turkey 
(TUBITAK) provides fellowships for foreign 
PhD students and postdoctoral scientists. 

Strong research output requires more than 
big investments. Saudi Arabia has poured bil- 
lions into new universities, particularly with 


Relative subject area distribution 


articles in the physical sciences.* 


Earth and environmental 


Four of the top five countries published at least 40% of their Index 


Hi israel 
Turkey 

9) Saudi Arabia 
Wilran 


Mi United Arab 
Emirates 


Other 


Physical sciences 


“Each slice represents the proportion each subject area contributes to a country's 
overall WFC. Subject areas can overlap, so the total percentage may exceed 100%. 


© 2015 Macmillan Publishers Limited. All rights reserved 


the 2009 opening of King Abdullah University 
of Science and Technology (KAUST), which is 
believed to have a US$15 billion endowment, 
and has attracted top talent for research. From 
2012 KAUST has increased its article output 
in the Nature Index by 40%. Given its more 
moderate increase in WFC over the same time 
period, it seems that KAUST researchers are 
actively contributing more to collaborative 
papers than working in isolation. 

Performance in the Nature Index is also 
greatly dependent on collaboration — and this is 
where the Arab states are doing well. Saudi Ara- 
bia reaches out extensively to other countries for 
scientific collaboration, with 79% of its output 
being the result of international collaborations. 
Israel and Turkey collaborated internationally 
on 46% and 59% of their papers, respectively, 
showing that scientists in Saudi Arabia collabo- 
rate much more than their colleagues in neigh- 
bouring countries. This high collaboration rate, 
however, may be due in part to the practice at 
some universities — although not KAUST, 
which opposes such an approach — of bringing 
in foreign researchers for a short amount of time 
to add a local affiliation to papers. 

Even higher international collaboration rates 
apply to other wealthy Gulf countries that have 
invested in higher education, such as the United 
Arab Emirates (UAE) and Qatar. Both countries 
host campuses for foreign institutions, includ- 
ing New York University, Weill Cornell Medical 
College, Northwestern University and Paris- 
Sorbonne. The UAE’s AC is 4.3 times its FC, 
while Qatar's is 5.8 times larger, supported by 
significant scientific collaborations. 

Collaborations can help stimulate organic 
improvement of science by educating home- 
grown researchers. Leaders in the region 
understand the value of educating a scientific 
workforce. “Building [an] education system and 
stimulating research must be entrenched within 
society,’ says Khaled Toukan, the chairman of 
the Jordan Atomic Energy Commission and the 
country’s former minister for Higher Education 
and Scientific Research. “You can't just import 


These brown magnets will focus the electron beam 
in the SESAME particle accelerator in Jordan. 


it by hiring a few expatriates who are world 
renowned and they sit and publish papers.” 

Jordan lacks the bountiful energy resources 
enjoyed by other countries in the region includ- 
ing Saudi Arabia, the UAE and Qatar, which 
allow them to super-charge their research budg- 
ets. Instead, the country — which is yet to break 
into the top 100 — is having to slowly build its 
scientific community from the bottom up, a pro- 
cess that will probably require working with other 
countries. To build its first reactor for nuclear 
research, for example, Jordan’s Board of Com- 
missioners of the Energy and Mineral Resources 
Regulatory Commission sought the help of the 
Korea Institute of Nuclear Safety. Korean experts 
will help to build and run the facility, expected to 
be operational in 2016 at the Jordan University of 
Science and Technology in Irbid. 


ACCELERATING INTERNATIONAL SCIENCE 
Two big projects in Jordan should help improve 
national and international collaboration, Toukan 
says. Two 1,000-megawatt nuclear reactors are to 
be constructed in the northern part of the coun- 
try through a US$10 billion partnership with 
Russia, and the Synchrotron-light for Experi- 
mental Science and Applications in the Middle 
East (SESAME) in Allan, Jordan will be the first 
particle accelerator in the region. 

SESAME '’s leaders hoped to get the facil- 
ity online by the end of 2015. But a severe 
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snowstorm in December 2013 caused the roof 
of the experimental hall at SESAME to cave in, 
which set the project back by a year. A new roof 
has been built and construction has resumed. 
The plan now is to have two beam lines ready 
by the end of 2016, one for hard X-rays — those 
with the highest energy — and one for infrared 
spectromicroscopy. With these lines, researchers 
hope to investigate, for example, cells and viruses 
and make breakthroughs in medicine. 

The SESAME site certainly won't be among 
the largest particle accelerators in the world. Its 
storage ring is only 133 metres in circumfer- 
ence — compared to 1,500 metres for SPring 8 
in Japan — but it will be able to accommodate 
several experiments at a time when all of its 25 
beam lines are functioning. 

SESAME is an ambitious project on many 
levels. Launched by UNESCO, with oversight 
from CERN, it is receiving input and support 
from some of the top scientific leaders in coun- 
tries including Germany, France and Italy. 
But its primary aim is to serve researchers in 
the region who come from areas with deeply 
divided politics, including Egypt, Iran, Israel, 
Palestine, Turkey, Bahrain, Pakistan and Jordan. 

“This is a positive [move], where rather than 
destroying each other, people are working 
together to promote science for peace, harmony 
and positive contribution to the modern world” 
says Toukan. 

The hope is that SESAME can help alleviate 
persistent drawbacks for the region, includ- 
ing brain-drain, a lack of collaboration among 
researchers and low numbers of published 
papers. The laboratory could become a home 
for researchers considering going abroad or it 
might even lure back scientists who have already 
left. And with all the applications of synchro- 
trons, it could bring together researchers who 
would have otherwise never interacted. 

“To survive in a global market, you really 
have to build industry,’ Toukan says. “And the 
only way to do it is through advanced scientific 
research and development, which the countries 
in this region realize now.’ = 


Top 5 institutions’ relative subject area distribution 
Israel's Weizmann Institute of Science led the region, and it published 
about equally in life and physical sciences.* 
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Top 5 institutions’ collaborativeness 

Among the top five, the Weizmann Institute of Science 
published the most but collaborated the least.* 
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Central & South Asia 


India’s dominance continues in this region, but its future 
funding is tenuous, while neighbouring smaller countries 
face more fundamental obstacles to scientific success. 


ARTICLE COUNT (AC): 1,574 
FRACTIONAL COUNT (FC): 1,057 
WEIGHTED FRACTIONAL COUNT (WFC): 944 


he countries of Central & South Asia 
| have spent the last decade building 
their science profiles by expanding 
their community of researchers. Nonetheless, 
this region finishes 6th out of 9 in this Nature 
Index, based on its 2014 weighted fractional 
count (WFC) of 944. Most of the region's publi- 
cations in the Index focus heavily on chemistry 
or physical sciences (47.7% and 40.9% of the 
region's total WFC, respectively), leaving little 
capacity for work in life and earth and environ- 
mental sciences. 

Looking at the reasons, it seems that trends 
in researcher mobility may play a role in keep- 
ing the region from producing more work 
published in the top journals. Brain drain, for 
instance, is a big problem for countries in the 
region — an exodus of top talent that benefits 
countries at the top of the scientific pyramid 
leaves other nations bereft of leading research- 
ers. Gail Joseph, a researcher at North Carolina 
Agriculture and Technology State University, 
says, “I see so many Indians here [in the US] 
who are excellent researchers. They feel more 
appreciated here, plus the infrastructure and 
technology [are] much more advanced abroad.” 

While the lure of international institutions is 
a factor, scientists in the region do not regularly 
team up with researchers from other countries. 
With 33% of articles arising from international 
collaborative efforts, scientists in Central & 
South Asia collaborate on fewer projects than 


the regional aggregate of the WFC for the 
Index (46%). When researchers in this region 
do collaborate internationally, it is with scien- 
tists from North & West Europe (44% of inter- 
national collaborations) and North America 
(30%). “The trend is for countries in the region 
to work more with the West than building 
regional networks,” says Ravi Silva, director 
of the Advanced Technology Institute at the 
University of Surrey in the United Kingdom. 


“| SEE SO MANY INDIAN 
RESEARCHERS WHO 
FEEL MORE APPRECIATED 
IN THE UNITED STATES.” 


The lack of collaborations within the region 
could be explained by one local philosophy. 
“Our emphasis has been growing the talent in 
India,’ says physicist Ajay Sood at the Indian 
Institute of Science (IISc) in Bengaluru. “We 
have a very low percentage of people com- 
ing from other countries, almost negligible.” 
Nonetheless, India performs reasonably well 
on the global level. 


INDIA’S DOMINANCE 
India ranks 13th globally in terms of WFC and 
is by far this region's leader in science in the 
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The most dominant leader in any region, India produced 98% 


of the Index articles from Central & South Asia. 
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Index, contributing 97.6% of Central & South 
Asias WFC in 2014. For instance, India’s 2014 
WEC includes an article in Nature Genetics on 
heart disease in children, which reveals pos- 
sible targets for therapies related to mutations 
in the RAFI gene. 

Despite such clearly valuable research and 
India’s WFC increasing by 8.3% from 2013 
to 2014, some experts question the impact of 
India’s publications. 

Improving the quality of India’s published 
papers will require more spending on research. 
Funding is divided into national laboratories 
that receive block grants and universities that 
compete for grants. Since there are more sci- 
entists in colleges and universities, they are 
competing for funds and working with limited 
resources. Moreover, Indian National Science 
Academy vice president S. C. Lakhotia calls the 
funding competition “lengthy and painful for 
active researchers.” 

Consequently, the universities face chal- 
lenges in attracting good scientists. Graduate 
students and young scientists are drawn more 
to research institutions than to universities, says 
Lakhotia. This trend, he says, leads to “spiraling 
inequity in quality of resources and manpower.’ 
“Nearly all of the traditional teaching institu- 
tions suffer from lack of a critical mass of qual- 
ity researchers and teachers.” 

Despite this precarious state of funding, 
Sood’s IISc has top quality equipment, and it 
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has attracted good researchers. Its WFC of 94 
places it third among institutions in the region 
and 118th in the world. Other top research 
organizations include the Indian Institutes of 
Technology (IITs), which are spread across the 
country and which, in aggregate, ranks 51st in 
the world and Ist in the region with a WFC of 
168. Reflecting India’s focus on chemistry, IITs is 
one of the top institutions in the Index for chem- 
istry worldwide, ranking 24th with a WFC of 107 
— up 43% compared to its WFC in 2013. 

Similar to the IITs, India’s Council of Scientific 
and Industrial Research (CSIR), which ranks 
72nd in the world with a WFC of 129, has doz- 
ens of labs and centres across India. It, too, is a 
key player in chemistry, being one of the top 30 
institutions in the Index for this discipline with 
a WEC of 90. Unlike the IITs, CSIR falls into 
the category of national lab, which guarantees 
block grants from the government. 


ABOVE AND BEYOND 

Looking only at international journals and 
funding doesn't tell the whole story, however. 
The success of India’s Mars Orbiter Mission 
— which entered the Mars atmosphere in Sep- 
tember and was recently reported to be so fuel 
efficient that the mission will last six months 
longer than originally planned — has beena 
major point of pride throughout the scientific 
community and the country in general. “It is a 
tremendous achievement at very low cost and 
successful at its very first attempt,’ says Ven- 
katesh Narayanamurti, dean of Harvard Uni- 
versity’s School of Engineering and Applied 
Sciences. He calls the mission India’s “single 
biggest accomplishment.” Narayanamurti also 
highlighted another scientific accomplishment 
for India that will have a strong impact on tech- 
nological capabilities: the development of the 
country’s first regional GPS system, which is 
expected to be completed in 2015. 

Despite these achievements, some research- 
ers are disappointed that funding for research 
hasn't increased as they had hoped it would 
with the new government of Prime Minister 


This launch put India’s Mars Orbiter in the 
planet’s atmosphere in September 2014. 


Narendra Modi, who took office in 2014. Fac- 
ing a slowdown in the economy, the Modi 
administration has failed in two budgets to 
boost R&D spending above 1% of the GDP. The 
funding increase of 4% was below- inflation and 
not enough to satisfy researchers. 

“There is an apprehension in the Indian sci- 
entific community that from this year onward 
the past increase in funding will not continue 
and may actually turn into severe cuts for the 
basic sciences,” says Kavita Dorai, a physicist 
at the Indian Institute of Science Education & 
Research Mohali in Punjab. “If this happens, it 
will have far-reaching consequences for both 
the scientific community and for the nation” 


LAGGING BEHIND 
The region's clear runner-up is Pakistan, which 
ranks 53rd in the world with a WFC of 13, 
down 29% compared to 2013. It has a sizable 
community of researchers approaching 53,000 
— or about 295 researchers per million citizens 
— but it has yet to get a single university into 
the Index’s top 10 institutions of the region. 
Similar to its neighbours, Pakistan’s publica- 
tion output leans heavily toward the physical 
sciences. 

Even though Pakistan lags behind India in 
scholarly output, it has had some notable suc- 
cesses. In mid-December, Pakistan celebrated 
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the country’s admittance as an associate mem- 
ber of CERN, the European Organization for 
Nuclear Research — a status not yet attained 
by regional rival India. 

Pakistan’s government is also moving for- 
ward with its initiative to close the gap between 
academia and industry in an attempt to boost 
the country’s economy. The government has 
now supported the opening of 38 Offices of 
Research, Innovation and Commercializa- 
tion (ORICs) in universities around the coun- 
try since the initiative began in 2011. Aliya 
Rehman, director of the University of Kara- 
chi’s ORIC, notes there is some resistance to 
these collaborations from both industry and 
academia. She recently conducted a survey 
that found that while hundreds of patents have 
been awarded to researchers at the University 
of Karachi, none of them have yet been com- 
mercialised. Rehman says this is due to lack of 
initiative and roadblocks in the process. “They 
do their research and it just goes in their cup- 
boards,’ she said. One promising collaboration 
she is working on will link research being done 
on Pakistani seaweed with a company that has 
been manufacturing herbal medicine and cos- 
metics using imported seaweed. 

Still, Pakistani researchers face many bar- 
riers that are causing many of the brightest to 
seek opportunities elsewhere, says Rehman. 
“You have no idea how difficult it is to do 
research in Pakistan.” She points to the fund- 
ing mechanism in particular. “The funding 
agencies are very slow to process the project 
applications and by the time the funding 
comes, the work has been done in some other 
part of the world,’ she says. 

Exacerbating the situation, some countries 
in Central & South Asia are still facing obsta- 
cles that scientists in developed nations cannot 
even imagine. In Pakistan, for example, Rehman 
points to electricity shortages and political 
instability as major impediments to research, 
sometimes closing down the universities for 
days. “It is an extremely difficult position and 
that is why a lot of people are leaving,’ she says. m 


Top 5 institutions’ relative subject area distribution 
Since the top five institutions came from India, they reflected the 
country's emphasis on chemistry.* 
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Top 5 institutions’ collaborativeness 
Compared to top institutions in other regions, Indian 
institutions are more self-contained and collaborate far less.* 
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Latin America & Caribbean Islands 


There is scope for improvement with the volume and quality 
of research in this region, but some innovative programmes 
for study abroad might improve future publications. 


ARTICLE COUNT (AC): 2,249 
FRACTIONAL COUNT (FC): 864 
WEIGHTED FRACTIONAL COUNT (WFC): 574 


razilian economist Anténio Delfim 
B Netto, one of the government officials 

credited with the country’s rapid eco- 
nomic development during the 1970s, famously 
argued: “One needs to wait for the cake to rise 
before cutting any slices.” That is, a country 
should not try to redistribute wealth with- 
out a preceding phase of vigorous economic 
growth. If one applies Delfim Netto’s maxim 
to Latin American science, it is clear that the 
cake has risen — there are almost seven times 
more papers published by Brazilian scientists 
in indexed journals, for instance, than 20 years 
ago, according to the Scimago/SCOPUS data- 
base, which started decades before the Nature 
Index — but this hasn’t made it very com- 
petitive in the Index. The growth in absolute 
numbers is still not enough to place countries 
in Latin America and the Caribbean Islands 
among the world’s leading scientific commu- 
nities, according to the Nature Index. 

Based on the Nature Index’s 2014 weighted 
fractional count (WFC), Brazil produced the 
region’s highest output and is 23rd globally with 
a 2014 WEC of 237. Behind Brazil, Argentina, 
Chile and Mexico came in 30th, 32nd and 34th 
in the global Index. Physical sciences account 
for 55.4% of the region's WFC. 

Brazil puts the most emphasis on scientific 
research of all countries in the region — spend- 
ing 1.3% of its GDP on R&D, followed by Mexico 
at 0.5% ofits GDP. Among the four top countries, 
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Chile provides the least funding for research — 
about 0.36% of its GDP (which is also by far the 
lowest of the four in absolute terms, because it 
also has the lowest GDP) — but its WFC (95) is 
barely lower than Argentina’s (99). 

In this region, public funding is much more 
important than partnerships with the private 
sector, and scientists rarely become entrepre- 
neurs. Despite this, some government promises 
for increased support have not been realized. 
For example, scientometrist Francisco Collazo- 
Reyes of Mexico's National Polytechnic Institute 
says, “The winning tickets in the last several 
presidential campaigns here have several times 
promised to raise funding to 1% of GDP, but this 
hasn't materialized so far” 


“ONE NEEDS TO 
WAIT FOR THE CAKE TO 
RISE BEFORE CUTTING 

ANY SLICES.” 


COLLABORATING ON COMMODITIES 

The pattern of international collaborations in 
Latin America resembles that of North Amer- 
ica. Both regions lean toward research part- 
ners in North & West Europe. In fact, 47% of 
Latin American and 49% of North American 
international collaborations involve scientists 
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from these areas. Latin American scientists 
also collaborate often with their neighbours 
to the north: 28% of Latin American interna- 
tional collaborative work is done with US and 
Canadian scientists. 

Much of Latin America’s economic growth in 
the past couple of decades has been fuelled by 
Chinese demand for commodities. For exam- 
ple, China imports Brazilian and Argentinean 
soybeans and Brazilian iron ore. Latin Ameri- 
can research centres, however, still seem largely 
oblivious to the potential of Asian science. Only 
4% of international collaborations by scientists 
from Latin America and the Caribbean Islands 
include colleagues from East & Southeast Asia. 

On average, though, scientists in Latin 
America and the Caribbean Islands collabo- 
rate more often internationally than scientists 
around the world. In addition, its scientists 
collaborate significantly with others in North 
America and Europe. For instance, four 
researchers from Brazil — plus others from 
France, Italy, Spain, the United States and Rus- 
sia — published a 2014 Nature Biotechnology 
article on the sequencing of citrus. The research 
itself even added to this article’s international 
flavour, since it explored the phylogeny of 
plants that started to be domesticated thou- 
sands of years ago in Southeast Asia. “Advances 
in citrus genomics should soon allow the iden- 
tification of the somatic mutations that, with 
their ancient genetic backgrounds, underlie the 
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diversity of citrus colour, flavour and aroma in 
modern cultivars,’ the authors concluded. 

But citrus is far from the only type of agri- 
cultural product that is driving Brazilian sci- 
ence. Indeed, although Brazil grows more than 
700,000 hectares of orange trees, it plants more 
than 10 times that much in sugar cane. In fact, 
Brazil is the world leader in producing sugar- 
cane ethanol, which has become an attractive 
alternative fuel for vehicles. In 2014, accord- 
ing to Bloomberg, Brazil's state-development 
bank, Banco Nacional de Desenvolvimento 
Economico e Social, invested US$843.5 mil- 
lion to increase the productivity of turning 
sugar cane into ethanol. 


SIZE MATTERS 

Publicly funded universities that, by North 
American and European standards, have huge 
numbers of students and staff, are responsi- 
ble for most of the research in Latin America 
and the Caribbean. The two Latin American 
institutions with the highest WFCs are Brazil's 
University of Sao Paulo (USP), with 100,000 
undergraduate and graduate students and 6,000 
faculty, and the National Autonomous Univer- 
sity of Mexico, with more than 300,000 students 
and about 37,000 academic staff. 

Rogerio Mugnaini, a scientometrist at USP, 
argues that institutions of such size are bound 
to perform ina relatively irregular way when it 
comes to publishing research. He says, “They 
will certainly have a number of groups in a 
position of international leadership, but also a 
very long tail of researchers that don't perform 
so well? 

Asa result, he says, the overall performance 
of these gargantuan institutions suffers. None 
of the leading institutions in Latin America is 
among the top 200 in the world based on the 
Index’s WFC. 

Research strength in Latin America tends 
to be much more geographically concentrated 
than in other regions. Brazil, for example, has 
four institutions in the region's top 10, and 
three — USP, Sao Paulo State University and 


This processing factory in Valparaiso, Brazil, 
turns sugar cane into ethanol for fuel. 


the State University of Campinas — are in Sao 
Paulo State; the other top-10 institution, the 
Federal University of Rio de Janeiro, is in the 
adjacent state of Rio de Janeiro. About half of 
Brazil's science output comes from Sao Paulo, 
although a 2014 Nature Genetics article on the 
human hookworm’s genome included a Brazil- 
ian author from Minas Gerais. 

Both Chile and Argentina have two institu- 
tions in the regions’ top 10, and the University 
of Puerto Rico, the sole representative of this 
US territory, is also among the top 10. 

For Chile, says Atilio Bustos-Gonzalez of 
the Scimago Research Group at Adolfo Ibafez 
University in Vifia del Mar, Chile, one of the 
main challenges is how to increase the output 
of quality science in research centres outside 
of Santiago. 


REACHING FOR RECRUITMENT 

To increase the international relevance of Latin 
American science, governments are creating 
programmes to fund scholarships overseas. 
The most ambitious and controversial so far 
is Brazil’s Science Without Borders, which 
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Top 5 institutions’ relative subject area distribution 
In a regional largely focused on physical sciences, the National Autonomous 
University of Mexico generated more than 40% of it Index articles in the life sciences.* 
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*Each slice represents the proportion each subject area contributes to an institution's 
overall WFC. Subject areas can overlap, so the total percentage may exceed 100%. 
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started in 2011. Its initial aim was to send about 
100,000 young scholars — mostly undergradu- 
ates, plus 15,000 doctoral candidates and 6,500 
post-docs — to institutions abroad. President 
Dilma Rousseff pledged to double that number 
during her re-election campaign in 2014. 

The initiative has been criticized for lax 
selection criteria and implementation. For 
example, some undergraduates have been 
accepted despite having little command of the 
language of the country to which they were 
headed, and there are no clear rules to monitor 
their activities abroad. 

The ease with which some students have 
managed to spend most of their time travel- 
ling has led critics to call the initiative “Tour- 
ism Without Borders.” 

“The focus on undergraduates has attracted 
a lot of scepticism about the long-term effects 
of the programme in Brazilian science,’ says 
Mugnaini. 

Ona more modest scale, the Chilean Min- 
istry of Education has since 2008 awarded 
Becas Chile (Chile Scholarships) for masters 
students, doctoral candidates and post-docs. 

Those who receive the scholarships sign an 
agreement to return to Chile after the end of 
their fellowship. “The first PhDs of Becas Chile 
are starting to return, and that is bound to gen- 
erate an increase in the country’s scientific out- 
put,’ predicts Bustos-Gonzalez. 

The hope is that programmes like Science 
Without Borders and Becas Chile will give 
Latin American researchers a chance to inter- 
act with scientists from countries with higher 
WFCs and attract foreign scientists to Latin 
America. 

The Brazilian programme, for example, is 
offering 2,000 fellowships for “special visiting 
researchers” who are willing to spend at least 
a month each year in Brazil for a three-year 
period. 

Increasing the cooperation with first-world 
scientists might improve quality in publica- 
tions from Latin America & Caribbean 
Islands. = 


Top 5 institutions’ collaborativeness 
Four of the top five institutions used international collabora- 
tors on more than 70% of their 2014 Index articles.* 
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“Each bar represents the proportion of an institution's overall output in the 
Index (AC) stemming from domestic and international collaborations. 
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Africa 


Public health demands might drive advances in 
research, but scientists, working against the odds, 
must also strive to publish in globaljournals. 


ARTICLE COUNT (AC): 651 
FRACTIONAL COUNT (FC): 172 
WEIGHTED FRACTIONAL COUNT (WFC): 128 


behind the rest of the world, and economic 

factors explain some of its shortcomings. 
Although Africa is home to nearly 15% of the 
world’s population, this region only produced 
0.25% of the 2014 weighted fractional count 
(WEC) in the Nature Index. South Africa, 
which accounted for 64% of the region's 2014 
WEG, spent only US$2.8 billion on R&D 
(0.76% of its GDP). By comparison, the United 
States spent US$466 billion (2.70% of GDP), 
and generated a WFC of 17,937. Africa’s poor 
publishing record in the Index is down to more 
than economics. 

In many cases, African scientists publish 
in journals that are not counted by the Index. 
“Many of the scientists we work with in Africa 
are publishing quite often, but most often their 
work is being published in much more local, 
regional or continental journals or ones with 
very specific topic areas,’ says Nina Dudnik, 
founder of the US-based nonprofit, Seeding 
Labs, which provides used equipment and 
training to scientists in developing countries in 
Africa and elsewhere. She cites one colleague 
in Ghana with 73 publications — virtually all 
of them in narrowly focused or regional publi- 
cations that are not in the Index, such as Food 
Research International and the Journal of the 
Ghana Science Association. “Often scientists do 
not know the right way to pitch their research 
to a major journal,’ says Dudnik. 


IE scientific publishing, Africa trails far 


Most of Africa barely registers on the Index. 
Behind South Africa, Egypt contributes 11% 
of the region’s 2014 WFC. Kenya, Algeria and 
Tunisia provide another 4%, 3.5% and 2%. Nine 
out of ten of Africa's top institutions by WFC are 
in South Africa, with the only exception being 
the Mpala Research Centre in Kenya. 


“SOMETIMES IT’S 
NOT JUST LACKING 
CUTTING-EDGE 
EQUIPMENT, BUT ANY 
EQUIPMENT AT ALL.” 


PUBLIC HEALTH’S IMPACT 

While Africa gets most of its WFC from physi- 
cal sciences, efforts are under way to broaden 
the region’s research scope. In particular, 
recent public-health challenges might drive 
more research in life sciences. For example, 
the Ebola epidemic — the worst ever recorded 
— has taken more than 10,000 lives, according 
to the US Centers for Disease Control and Pre- 
vention (CDC). This outbreak spurred the One 
Health research programme at the Southern 
African Centre for Surveillance of Infectious 
Diseases to assemble a multi-national team of 
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Countries’ weighted fractional count (WFC) 
South Africa and Egypt contributed 75% of the region's 
articles in the Index. 


(Other: 18) 


TOTAL WEC: 


128 
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African researchers in Johannesburg to study 
strategies for controlling the virus, with fund- 
ing from the Wellcome Trust. In Tanzania, The 
Zanzibar Research Agenda 2015-2020 calls for 
a range of health initiatives that will require 
research, such as exploring the potential to 
apply nanomedicine — nanometer-scale tech- 
nology used in medical applications — to early 
detection of diseases, as well as diagnosis and 
follow-up on the impact of treatments. 

Africa makes some effort to publish articles 
in the life sciences, which receives the second 
highest emphasis based on it percentage of the 
region's total WEC. In Index data for 2014, only 
two regions — North America and Australasia 
& Pacific Islands — placed more emphasis on 
life sciences. 

Like other developing regions, Africa could 
drive more life sciences research through clin- 
ical trials. According to Judy Coates, scientific 
and regulatory affairs manager at the Innova- 
tive Pharmaceutical Association South Africa 
in Johannesburg, her country has experienced 
a stable annual investment in clinical trials of 
about US$200-250 million in recent years. 
A study by her organization, however, shows 
that this figure could grow to about US$400 
million if South Africa’s pharmaceutical regu- 
lator, the Medicines Control Council, could 
improve the ability for companies to run trials 
with fewer delays. 

Even with a relatively consistent investment 


Relative subject area distribution 


Although 91% of Kenya's Index articles explore life sciences, 


this came from a WFC of only 6.* 
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“Each slice represents the proportion each subject area contributes to a country's 
overall WFC. Subject areas can overlap, so the total percentage may exceed 100%. 


© 2015 Macmillan Publishers Limited. All rights reserved 


in clinical trials in South Africa, the overall 
region's share of the global market is declining. 
A recent study by US-based Quintiles found, 
for example, a 9.1% decrease in sub-Saharan 
Africa's share of the global clinical-trial mar- 
ket from 2010 to 2012. In fact, most of that fall 
was in South Africa, where the share of the 
world market for clinical trials dropped by 
7.9%. The experts at Quintiles speculated that 
sub-Saharan Africa could be losing some of the 
niche market in clinical trials, such as ones for 
malaria, which has been a driver in the past. 

Still, scientists in the region study the 
local diseases. In 2014, for instance, a scien- 
tist from The Gambia was one of the dozen 
authors on a Nature Communications article 
about artemisinin combinations for treat- 
ing malaria. Originally, this drug came from 
sweet wormwood (Artemisia annua), which is 
aherb used in traditional Chinese medicine, 
but now it can be mass-produced in genetically 
modified yeast. This study showed the value of 
doing research in Africa, because the authors 
concluded: “We find that a locally optimized 
treatment policy can be highly cost effective 
for reducing clinical malaria burden.” 

In addition, the healthcare industry can 
invest in Africa beyond clinical trials. For the 
pharmaceutical industry overall, “the compa- 
nies are more involved in Africa since it is a 
growth market, and they are using numerous 
mechanisms, including investing in research 
infrastructure as well as initiating projects,” 
says Dries Oelofse, business development 
manager at H3-D, a drug development centre 
at the University of Cape Town. For exam- 
ple, UK-based GSK funded a US$6 million 
grant for African researchers to study non- 
communicable diseases in the region. 

Despite Africa’s small output compared to 
the rest of the world — or perhaps in some 
ways because of it — the region leads a signifi- 
cant metric in the Index. In the percentage of 
aregion’s WFC published in Nature or Science, 
Africa’s 4.7% is the highest, surpassing second 
place North America (4.5%). Granted, that’s 


Prevotella bacteria in the gut of Hadza hunter- 
gatherers enhance the ability to digest plant material. 


only about six publications for Africa (versus 
870 for North America), but this is still an 
important metric. 


COLLABORATION KING 

Africa led the world in collaborations in 2014 
articles in the Index. Overall, more than 70% 
of Africa’s output comes from work done with 
scientists from other countries. This makes 
Africa the most internationally collaborative 
of all regions in the Index. On average, regions 
only collaborated internationally on 46% of 
their articles. Africa collaborated most in the 
earth and environmental sciences (85%) and 
the life sciences (80%). It collaborated less in 
the physical sciences (73%) and chemistry 
(69%). All but a few percent of those collabo- 
rators are with countries outside Africa. The 
region that African scientists most frequently 
work with is North & West Europe. This 
research bond arises in part from programmes 
that encourage collaborations between Afri- 
can scientists and those in the United King- 
dom. For example, the Leverhulme Trust in 
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the United Kingdom offers funding up to 
about US$750,000 over five years for some 
research projects. 

Even when collaborating with scientists 
outside of the region, the research can make 
use of African resources. A 2014 Nature Com- 
munications article on the gut microbiome of 
Tanzania’s Hadza hunter-gatherers included 
one author from the country, and the research 
explored how bacteria might have co-evolved 
with this human population. The results sug- 
gest that specific bacterial species and combi- 
nations impact how these people gain nutrition 
from even very fibrous plants. 

To achieve more success in international 
publishing for Africa overall, though, the sci- 
entists need modern tools. A lack of advanced 
equipment limits high-profile publishing. 
As Dudnik explains, “Researchers using 
equipment that is not the Western gold stand- 
ard have to work harder to demonstrate the 
validity of their results.” That surely reduces 
Africa’s impact. 

In 2014, for example, Dudnik wrote in The 
Wall Street Journal about the US Centers for 
Disease Control and Prevention establishing 
a laboratory in a hospital in Uganda to test 
blood samples for Ebola in 2000. “Yet 14 years 
later,” Dudnik wrote, “that lab is gone and no 
institution in the region has the equipment 
required to test for Ebola or other prevalent 
diseases including yellow fever, hepatitis, Mar- 
burg virus or HIV’ So sometimes it is not just 
lacking cutting-edge equipment, but lacking 
any useful equipment at all. 

“Ultimately, says Dudnik, “the scientific 
output of African researchers receives less 
attention, reaches a smaller audience and is 
under-catalogued by global indices.’ By look- 
ing to international publications as potential 
outlets whenever possible, scientists in this 
region could start to work their way into the 
global world of science. 

Spreading that trend across all of Africa, 
however, will take time, improved laboratory 
equipment and increased funding. = 


Top 5 institutions’ relative subject area distribution 
All of the top five institutions in the region were in South Africa, 
and they published, on average, across the range of categories.* 
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Top 5 institutions’ collaborativeness 
African scientists collaborated more than any region in 
the world, and the top five institutions followed this trend.* 
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“Each bar represents the proportion of an institution's overall output in the 
index (AC) stemming from domestic and international collaborations. 
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A guide to the Nature Index 


A description of the terminology and methodology used in this supplement, 
and a guide to the functionality available free online at natureindex.com. 


he Nature Index is a database of author 

| affiliations and institutional relation- 

ships, used to track contributions 

to articles published in a group of highly 

selective science journals that have been 

chosen by an independent group of active 
researchers. 

Data in the Nature Index are updated 
monthly, with the most recent 12 months of 
data made available under a Creative Com- 
mons license at natureindex.com. 

The Nature Index provides absolute counts 
of high-quality publication productivity at the 
institutional and national level, and as such is 
one indicator of high-quality research output 
across the globe. 

The database is compiled by Nature Pub- 
lishing Group (NPG) in collaboration with 
sister company Digital Science. 

The current journals tracked by the Nature 
Index will be reviewed again in 2015, and from 
2016, coverage will be extended to include the 
clinical sciences. 


NATURE INDEX METRICS 

There are three measures provided by the 
Nature Index to track affiliation data. The sim- 
plest is the article count (AC). A country or 
institution is given an AC of 1 for each article 
that has at least one author from that country 
or institution. This is the case whether an arti- 
cle has one or a hundred authors, and it means 
that the same article can contribute to the AC 
of multiple countries or institutions. 

To get a better sense of a country or institu- 
tion’s contribution to an article, and to remove 
the possibility of double-counting of articles, 
the Nature Index uses the fractional count (FC). 
FC takes into account the relative contribution 
of each author to an article. 

The total FC available per paper is 1, and 
this is shared between all authors under the 
assumption that each contributed equally. For 
instance, a paper with 10 authors means that 
each author receives an FC of 0.1. For authors 
who have worked with joint affiliations, the 
individual FC is then split equally between 
each affiliation. 

The third measure used is the weighted 
fractional count (WFC), which applies 

a weighting to the FC to adjust for the over- 
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NATUREINDEX.COM 


A global indicator of high-quality research 


Users of natureindex.com 
can search for specific 
institutions or countries 
and generate their own 
reports, ordered by article 
count (AC), fractional count 
(FC) or weighted fractional 
count (WFC). 

Each query will return a 
profile page that lists the 
country or institution’s 
recent research outputs, 
from which it is possible 
to drill down for more 


Institution name 


Country 


Collaboration Relationships 


1 January 2014 - 31 December 2014 WEG. 


Region: Global 42214 
Subject/journal group: All 


598.04 


558.30 


The table to the right includes counts of all 
research outputs for Institution name published 
between 1 January 2014 - 31 December 2014 
which are tracked by the Nature Index. 


Outputs by subject 


Below, the same research outputs are grouped by 
subject. Click on the subject to drill-down into a list 
of articles organized by journal, and then by title. 


Note: Articles may be assigned to more than one 


subject area. 


information. For example, 
articles can be displayed by 
journal, and then by article 
title. As in the supplement, 


Subject AC FC WFC 
research outputs are HE chemistry 276 «179.1 179.11 
organized by subject area. I Earth & Environmental Sciences 95 42.73 42.73 


The profile page also lists 
the institution or country’s 
top collaborators, as well as 
its relationship with other 
research organizations. 


©) Life Sciences 


|_| Physical Sciences 


representation of papers from astronomy and 
astrophysics. 

The four journals in these disciplines pub- 
lish about 50% of all papers in international 
journals in this field — approximately five- 
times the equivalent figures for other fields. 
Therefore, although the data for astronomy 
and astrophysics are compiled in exactly the 
same way as for all other disciplines, articles 
from these journals are assigned one-fifth the 
weight of other articles (i.e. the FC is multi- 
plied by 0.2 to derive the WFC). 

The total FC or WFC for an institution 
is derived by summing the FC or WFC for 
individual authors. 

The process is similar for countries, although 
complicated by the fact that some institutions 
have overseas labs that will be counted towards 
the host country totals. What’s more, there is 
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439 231.50 231.50 


652 284.48 244.74 


~ Return to institution outputs 


great variability in the way authors present 
their affiliations. Every effort is made to 
count affiliations consistently, and with the 
background of reasonable assumptions. 

For more information on how the affiliation 
information is processed and counted, 
please see the frequently asked questions at 
natureindex.com. 


THE SUPPLEMENT 

Nature Index 2015 Global is based on a 
snapshot of data from natureindex.com, 
covering articles published between 1 January 
and 31 December 2014. 

Most analyses within the supplement use 
the WFC as the primary metric, as it provides 
a more even basis for comparison, and in 
determining the relative contribution of each 
country/institution. m 
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Nature Index tables 


The Index’s leading countries and institutions for high-quality science, ordered by weighted 
fractional count (WFC) for 2014. Also shown are the total number of articles, and the 
change in WFC from 2013. Articles are from the 68 natural science journals that comprise 
the Nature Index (see Guide to the Nature Index, page S30). 


TOP 100 COUNTRIES 


ARTICLE 2013-2014 
2014 COUNTRY WFC COUNT CHANGE IN WFC 


al United States 17,936.51 26,638 18,581.10 -3.5% 

2 China 6,037.22 8,641 5,204.06 16.0% 

3 Germany 4,018.73 8,582 4,073.53 -1.3% 

4 United Kingdom 3,249.72 7,592 3,271.65 -0.7% 

5 Japan 3,200.43 4,976 3)S65150 49% 

6 France 222095) 5,243 2y232259) -0.5% 

1 Canada 1,488.90 3,226 1,480.64 0.6% 

8 Switzerland 1293.75 2115 1,169.17 10.7% 

9 South Korea 1,167.66 1,969 1,150.07 1.5% 

10 Spain 1,090.72 2,897 1,177.00 -7.3% 
a taly 1,046.98 3,052 1,078.08 -2.9% 
12 Australia 951.22 2,497 932.86 2.0% 
13 ndia 921.77 1,484 850.97 8.3% 
etherlands V57nSs 2,189 759.47 -0.3% 

15) Singapore 520.62 873 483.20 77% 
16 Sweden 514.55 1,407 496.38 3.7% 
17 srael 492.44 1,012 A7315 4.1% 
18 Taiwan 481.47 888 543.18 -11.4% 
19 Russia 370.18 1,147 344.26 7.5% 
20 Belgium 348.91 1,082 327.23 6.6% 
21 Denmark 321.26 1,036 296.85 8.2% 
22 Austria 318.98 856 27997 13197, 
23 Brazil 237.01 AID) 233.69 14% 
24 Poland 209.69 686 216.55 -3.2% 
25 Finland 194.97 590 1937 1.6% 
26 orway 149.81 415 122.42 22.4% 
at Czech Republic 126.33 3/2 118.43 6.7% 
28 Portugal 117.45 416 M23'53) -49% 
29 reland 113.91 Sei 116.73 -2.4% 
30 Argentina 98.58 321 105.55 -6.6% 
Sil ew Zealand 95.69 27,5 117.58 -18.6% 
32 Chile 95.13 857 75.43 26.1% 
33 Greece 91.77 320 90.15 1.8% 
34 Mexico 84.60 391 77.08 9.7% 
35 South Africa 81.67 418 81.62 0.1% 
36 Turkey 79.24 2or 59107 34.1% 
oy Hungary 78.52 278 54.92 43.0% 
38 Saudi Arabia 76.63 384 76.64 0.0% 
39 Iran 67.87 147 58155) 15.9% 
40 Ukraine 37.06 Ve] 28.58 29.7% 
Al Slovenia 36.36 106 43.20 -15.8% 
42 Thailand 22.69 Ek) 25109 -11.7% 
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ARTICLE 2013-2014 
2014 COUNTRY WFC COUNT CHANGE IN WFC 


43 Estonia 22/31 92 1327, 68.1% 
44 Romania 21.41 79 17.47 22.6% 
45 Croatia 17.56 102 21.86 -19.7% 
46 Serbia 17.00 55 13:95 21.9% 
47 Lithuania 16.85 58 11.26 49.7% 
48 Malaysia 15.81 54 14.14 11.8% 
49 Egypt 14.32 87 12.04 18.9% 
50 Slovakia 14.16 64 IA by/ 12.6% 
al Puerto Rico 13.66 38 11.24 21.6% 
52 Iceland 12.87 65 13.90 74% 
53 Pakistan 12.83 42 18.03 -28.8% 
54 Cyprus 10.95 28 8.90 23.1% 
55) Luxembourg 10.94 35 10'55 3.7% 
56 Uruguay 9155 22 5.21 83.5% 
57. United Arab Emirates 9.37 49 7.08 32.2% 
58 Colombia 9.36 6) 9.45 -1.0% 
59 Panama 8.84 Be 6.61 33.9% 
60 Bulgaria 8.59 63 9.49 -9.5% 
61 Armenia 8.38 30 7.98 5.0% 
62 Vietnam 53 29 25) -8.4% 
63 Belarus (Sal) 22 3.95 55.9% 
64 Indonesia 5.92 30 4.28 38.4% 
65 Kenya 5.64 25 bYS6 5.2% 
66 Qatar 4.98 35 3.96 25.6% 
67 Peru 4.92 29 4.22 16.6% 
68 Algeria 4.65 22 1.10 S22.500) 
69 Georgia 4.15 27 2.45 69.6% 
70 Costa Rica 3.50 25 1.49 134.6% 
71 Lebanon 3.08 14 3.34 7.9% 
72. Tunisia Pals 13 2.84 -2.8% 
73 Ecuador 2.48 19 2.48 0.1% 
74 Bangladesh 2.45 14 2.28 7.7% 
75) Venezuela 2.40 27 1.83 31.2% 
76 Morocco Zrii, 18 3.46 -31.4% 
77 Latvia 2.18 12 3.18 -31.4% 
78 Cuba 2.03 9 4.86 -58.2% 
79 Oman 2.00 8 129 54.8% 
80 Kazakhstan 1.93 9 2.85 -32.2% 
81 Philippines 1.82 16 3.33 -45.4% 
82 Moldova 1.68 10 3.35 -49.8% 
83 Nepal 1.64 8 0.82 99.4% 
84 Benin 1.62 6 osy 341.4% 
85 Uganda 1.61 ii 0.29 452.6% 
86 Mongolia Si i) 0.62 119.1% 
87 Malta 1.30 5) 0.46 179.4% 
88 Senegal iil 4 0.56 110.1% 
89 Cameroon 1.16 8 0.81 42.9% 
90 Uzbekistan 1.16 10 (O)al7/ 576.6% 
91 Bolivia 1.05 9 0.48 120.2% 
92 Mali 1.00 3 0.19 415.5% 
93 Macedonia 1.00 6 0.26 290.5% 
94 Kuwait 0.95 7 Sally -70.0% 
95 Namibia 0.95 4 0.09 988.5% 
96 Burkina Faso 0.93 4 0.05 1,944.6% 
S7 Azerbaijan 092 2) 0.56 63.0% 
98 Iraq 0.84 12 0.74 13.8% 
99 Sri Lanka 0.83 8 0.70 18.6% 
100 Kyrgyzstan 0.81 - 0.81 -0.2% 
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TOP 200 INSTITUTIONS 


2014 


44 


S36 | 


INSTITUTION 

Chinese Academy of Sciences (CAS) 

Harvard University 

French National Centre for Scientific Research (CNRS) 
Max Planck Society 

Stanford University (SU) 

Massachusetts Institute of Technology (MIT) 

The University of Tokyo (UTokyo) 


Helmholtz Association of German Research Centres (Helmholtz) 


University of Cambridge 

University of California Berkeley (UC Berkeley) 
University of Oxford 

Swiss Federal Institute of Technology Zurich (ETH Zurich) 
National Institutes of Health (NIH) 

University of California, San Diego (UC San Diego) 
Yale University 

University of Michigan (U-M) 

Peking University (PKU) 

Kyoto University 

The University of Texas at Austin (UT Austin) 
California Institute of Technology (Caltech) 
University of Toronto (U of T) 

University of California Los Angeles (UCLA) 
University of Washington (UW) 

Northwestern University (NU) 

Columbia University in the City of New York (CU) 
University of Pennsylvania (Penn) 

Spanish National Research Council (CSIC) 
University of Wisconsin-Madison (UW Madison) 
Swiss Federal Institute of Technology in Lausanne (EPFL) 
he Johns Hopkins University IHU) 


ornell University 


T 
Cc 
Osaka University 
The University of Chicago (UChicago) 
Princeton University 
University of Illinois at Urbana Champaign (UIUC) 
Tsinghua University (TH) 
anjing University (NJU) 
University of California Santa Barbara (UCSB) 
mperial College London (ICL) 
anyang Technological University (NTU) 
University of Science and Technology of China (USTC) 
ational University of Singapore (NUS) 
Zhejiang University (ZJU) 
University of California San Francisco (UCSF) 
University College London (UCL) 
Tohoku University 
University of North Carolina at Chapel Hill (UNC) 


Russian Academy of Sciences (RAS) 


Pennsylvania State University (Penn State) 
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COUNTRY 
China 
United States 
France 
Germany 
United States 
United States 
Japan 
Germany 
United Kingdom 
United States 
United Kingdom 
Switzerland 
United States 
United States 
United States 
United States 
China 
Japan 
United States 
United States 
Canada 
United States 
United States 
United States 
United States 
United States 
Spain 
United States 
Switzerland 
United States 
United States 
Japan 
United States 
United States 
United States 
China 
China 
United States 
United Kingdom 
Singapore 
China 
Singapore 
China 
United States 
United Kingdom 
Japan 
United States 
Russia 


United States 


WFC 
1,307.74 
865.09 
761.48 
648.86 
538.57 
501.38 
456.23 
440.09 
403.11 
365.55 
355199 
347.29 
342.43 
334.34 
303.63 
298.80 
2Msy/ 
286.15 
274.17 
270.83 
268.61 
260.59 
259.61 
25934) 
258213 
252.97 
242.90 
232.81 
232.70 
226.16 
226.04 
218.77 
208.64 
207.12 
206.36 
205.58 
202.30 
201.06 
200.52 
199.86 
TgIR75) 
190.30 
188.48 
188.02 
185.66 
178.14 
177.36 
171.47 
171.14 
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ARTICLE 
COUNT 


3,124 
2,615 
4,907 
2,950 
1,265 
1,483 
1,251 
1,629 
1,384 
1,236 
1,179 
978 
771 
832 
896 
855 
937 
716 
629 
1199 
829 
744 
766 
586 
851 
589 
1846 
654 
687 
783 
689 
555 
724 
680 
551 
611 
474 
552 
Ted 
371 
541 
465 
352 
446 
799 
426 
387 
910 
587 


2013 
WFC 


1,209.37 
852.85 
719.07 
727.32 
550.20 
507.17 
472.88 
422.02 
401.39 
362.16 
382.41 
275.31 
391.02 
344.80 
289.56 
344.95 
275.46 
Sisto 
246.84 
280.77 
256.09 
286.64 
248.89 
298.02 
25732 
295.67 
240.00 
251.98 
207.58 
223001 
247.53 
219.88 
190.48 
230.61 
229.86 
195.00 
196.00 
196.04 
185.01 
187.38 
IAS 7/5} 
191.76 
150.42 
221.76 
183.78 
189.32 
170.63 
165.96 
185.84 


CHANGE IN WFC 
8.1% 
1.4% 
5.9% 

-10.8% 
-2.1% 
-1.1% 
-3.5% 

4.3% 
0.4% 
0.9% 
-6.9% 
26.1% 

-12.4% 

-3.0% 
4.9% 
-13.4% 
5.8% 
-8.7% 
11.1% 
-3.5% 
4.9% 
-9.1% 
4.3% 
-13.0% 
0.3% 
-14.4% 
1.2% 
-7.6% 
12.1% 
1.0% 
-8.7% 
-0.5% 
9.5% 
-10.2% 
-10.2% 
5.4% 
3.2% 
2.6% 
84% 
6.7% 
9.1% 
-0.8% 
25.3% 
-15.2% 
1.0% 
5.9% 
3.9% 
3.3% 
-7.9% 


GLOBAL 
ARTICLE 2013 2013-2014 
2014 INSTITUTION COUNTRY WFC COUNT WFC 


50 University of California Davis (UC Davis) United States 169.69 478 183.16 74% 
Bill Indian Institutes of Technology (IITs) India 168.18 288 128.43 30.9% 
52 Duke University United States 167.60 501 173.44 -3.4% 
53 Fudan University China 162.59 344 129/22 25.8% 
54 University of Maryland, College Park (UMCP) United States 162.48 740 161.78 0.4% 
5) Washington University in St. Louis (WUSTL) United States 161.42 379 173.04 -6.7% 
56 Lawrence Berkeley National Laboratory (LBNL) United States 161.03 781 153.57 A9% 
57 Seoul National University (SNU) South Korea 160.31 429 158.14 14% 
58 University of Minnesota (UMN) United States 158.05 491 178.08 -11.2% 
59 Ludwig Maximilian University of Munich (LMU) Germany MSA 627 154.25 2.2% 
60 Rutgers, The State University of New Jersey (RU) United States 148.48 455 154.02 -3.6% 
61 RIKEN Japan 144.43 497 165.96 -13.0% 
62 New York University (NYU) United States 144.08 431 139/91 3.0% 
63 University of Colorado Boulder (CU-Boulder) United States 143.58 553 152.16 -5.6% 
64 Weizmann Institute of Science (WIS) Israel 142.96 296 150.96 -5.3% 
65 The Ohio State University (OSU) United States 142.01 65) 142.91 -0.6% 
66 The University of Manchester (UoM) United Kingdom 141.03 565 125.06 12.8% 
67 The Scripps Research Institute (TSRI) United States 134.75 288 178.38 -24.5% 
68 McGill University Canada SS) 3)7/ 468 136.55 -2.2% 
69 National Research Council (CNR) Italy 130.31 504 150.02 -13.1% 
70 The University of British Columbia (UBC) Canada 130.25 463 130.64 -0.3% 
71 University of California Irvine (UCI) United States 130.13 405 147.32 -11.7% 
W2 Council of Scientific and Industrial Research (CSIR) India 128.64 196 115.24 11.6% 
73 Texas A&M University (TAMU) United States 128.34 394 147.32 -12.9% 
74 Leibniz Association Germany 128.11 540 117.56 9.0% 
75 Tokyo Institute of Technology (Tokyo Tech) Japan 12723 cell 105.59 20.5% 
76 University of Bristol (UoB) United Kingdom 15.773 399 109.62 14.7% 
Ui Nagoya University Japan 125.28 432 132.97 -5.8% 
78 University of Copenhagen (UCPH) Denmark 124.84 615 117.29 6.4% 
as) Purdue University United States Iss0 349 112.74 9.4% 
80 University of Pittsburgh (Pitt) United States 122.10 380 148.22 -17.6% 
81 Technical University Munich (TUM) Germany 121.39 477 125.21 -3.1% 
82 Vanderbilt University (VU) United States 171.38) 368 101.83 19.2% 
83 The University of Texas Southwestern Medical Center (UT Southwestern Medical Center) United States 117.93 229 124.11 -5.0% 
84 Korea Advanced Institute of Science and Technology (KAIST) South Korea 116.39 228 103.03 13.0% 
85 Argonne National Laboratory (ANL) United States 113169 475 113.74 0.0% 
86 Atomic Energy and Alternative Energies Commission (CEA) France 112'95 1080 138.71 -18.6% 
Sy The University of Edinburgh United Kingdom 112.04 796 110.44 14% 
88 Indiana University (IU) United States 111.67 354 119.17 -6.3% 
89 University of Southern California (USC) United States 159 270 127.44 -12.4% 
90 Heidelberg University (Uni Heidelberg) Germany iiss 631 105.66 5.4% 
ol University of Geneva (UNIGE) Switzerland 111.18 446 108.07 2.9% 
oe The University of Queensland (UQ) Australia 108.83 392 94.78 14.8% 
93 Georgia Institute of Technology (Georgia Tech) United States 108.31 267 137.43 -21.2% 
94 Emory University United States 107.54 209 108.36 -0.8% 
95 National Taiwan University (NTU) Taiwan 107.49 P2538) 102.57 4.8% 
96 University of Florida (UF) United States 106.90 414 105.28 1.5% 
S7 Shanghai Jiao Tong University (SJTU) China 106.84 282 96.01 11.3% 
98 Lanzhou University (LZU) China 106.31 176 69.71 52.5% 
99 University of Utah (Utah) United States 105.95 282 113.06 -6.3% 
100 Monash University Australia HOSE! 382 94.89 8.7% 
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GLOBAL 
ARTICLE 2013 2013-2014 
2014 INSTITUTION COUNTRY WFC COUNT WFC 


101 Los Alamos National Laboratory (LANL) United States 102.26 S/o 115.64 -11.6% 
102 University of Minster (WWU) Germany 101.94 ek 94.17 8.2% 
103 Jilin University JLU) China 101.87 184 97.72 4.2% 
104 University of Zurich (UZH) Switzerland 101.37 422 104.36 -2.9% 
105 National Institute for Material Science (NIMS) Japan 100.71 240 Ey) 23.1% 
106 Pierre and Marie Curie University (UPMC) France 100.44 1372 96.46 4.1% 
107 Agency for Science, Technology and Research (A*STAR) Singapore 100.33 284 81.62 22.9% 
108 Hokkaido University (HU) Japan 100.03 234 105.93 -5.6% 
109 Utrecht University (UU) Netherlands 99°89 475 111.50) -10.4% 
110 Kyushu University Japan 98.31 2H5) 84.56 16.3% 
111 Rice University United States 97.96 275 82.53 18.7% 
112 University of Alberta (U of A) Canada 97.78 295 101.74 -3.9% 
dis University of Gottingen Germany 97.69 369 98.58 -0.9% 
114 Hebrew University of Jerusalem (HUJI) Israel 95.18 245 91.30 4.3% 
SS) University of California Riverside (UCR) United States 94.82 279 TIAL 22.0% 
116 University of Groningen (RUG) Netherlands 94.80 425 74.72 26.9% 
AY, University of Wirzburg Germany 93.86 260 95.03 -1.2% 
118 ndian Institute of Science (IISC) India 93.72 149 83.64 12.0% 
119 ankai University (NKU) China 93.09 222 113.52 -18.0% 
120 Xiamen University (XMU) China 82.73 204 76.02 22.0% 
a2] ational Institute of Standards and Technology (NIST) United States 92.54 229 88.53 4.5% 
122 University of Rochester (UR) United States 92.34 291 104.73 -11.8% 
123 Boston University (BU) United States 92.14 389 99.12 -7.0% 
124 University of Basel Switzerland 90.89 251 87.67 3.7% 
125 Wuhan University (WHU) China 90.67 154 98.80 -8.2% 
126 Sichuan University (SCU) China 89.44 167 76.83 16.4% 
iy University of Leuven (KU Leuven) Belgium 88.93 393 91.46 -2.8% 
128 National Aeronautics and Space Administration (NASA) United States 88.79 1,006 94.01 -5.6% 
129 Memorial Sloan-Kettering Cancer Center (MSKCC) United States 88.29 rallge 7os5 17.5% 
130 University of New South Wales (UNSW) Australia 88.11 2i7 69.81 26.2% 
131 University of Chinese Academy of Sciences (UCAS) China 88.05 516 71.24 23.6% 
132 Sun Yat-sen University (SYSU) China 87.87 en 79.43 10.6% 
133 The University of Arizona (UA) United States 8757 522 78.02 12.2% 
134 Australian National University (ANU) Australia 87.16 498 93.53 -6.8% 
Hes) Tel Aviv University (TAU) Israel 86.99 318 78.16 113% 
136 Soochow University China 86.12 158 65.31 31.9% 
eas Baylor College of Medicine (BCM) United States 85.63 els 7615 12.5% 
138 Aarhus University (AU) Denmark 85.28 294 S135 10.3% 
so University of Erlangen-Nuremberg (FAU) Germany 84.46 248 84.55 -0.1% 
140 Oak Ridge National Laboratory (ORNL) United States 84.36 269 86.77 -2.8% 
141 Brown University United States 83.87 245 Tlseul 17.3% 
142 Humboldt University of Berlin (HU Berlin) Germany Sort 356 76.98 8.7% 
143 The University of Melbourne (UniMelb) Australia 82.60 478 89.76 -8.0% 
144 Academia Sinica Taiwan 81.82 476 80.21 2.0% 
145 U.S. Department of Energy (DOE) United States 81.44 17s 74.77 8.9% 
146 The University of Virginia (UVA) United States Si30 319) Se 4.5% 
147 Lund University (LU) Sweden 81.10 328 7299 11.1% 
148 East China Normal University (ECNU) China 80.63 144 65.56 23.0% 
149 National Institute for Health and Medical Research (INSERM) France 80.44 654 70.08 14.8% 
150 University of Bonn (Uni Bonn) Germany 80.37 464 78.01 3.0% 
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GLOBAL 
ARTICLE 2013 2013-2014 
2014 INSTITUTION COUNTRY WFC COUNT WFC 


dpi Arizona State University (ASU) United States 79.94 255) 80.58 -0.8% 

152 The University of Sydney (USYD) Australia 19.89 492 95.14 -16.0% 

153 University of Southampton (Soton) United Kingdom 79.04 315 85.53 -76% 
154 Technion-lIsrael Institute of Technology (IIT) Israel 78.91 231 67.90 16.2% 
155 University of Montreal Canada 78.01 B27 68.73 13.5% 
156 Stony Brook University (SBU) United States 77.80 302 TE79 -1.3% 
157 Brookhaven National Laboratory (BNL) United States 1/83 323 67.83 14.4% 
158 University of Freiburg Germany EL39 282 F275 6.4% 
159 Uppsala University (UU) Sweden 76.75 Sel 80.05 -4.1% 
160 Durham University United Kingdom 76.61 386 84.05 -8.8% 
161 The University of Nottingham (UoN) United Kingdom 76.54 236 74.17 3.2% 
162 University of Hamburg (UHH) Germany 76.48 361 83.82 -8.8% 
163 Indian Institute of Science Education and Research (IISER) India 75.88 106 58.72 29.2% 
164 RWTH Aachen University (RWTH Aachen) Germany Ue 205 87.18 -13.3% 
165 Case Western Reserve University (CWRU) United States 1533 201 76.24 -1.2% 
166 Joseph Fourier University (UJF) France 74.84 682 69.43 7.8% 
167 Leiden University Netherlands 73.69 482 73.08 0.8% 
168 Hong Kong University of Science and Technology (HKUST) China 13.09 133 54.60 34.8% 
169 Hunan University (HNU) China 73.23 105 54.57 34.2% 
170 lowa State University (ISU) United States 42.56 208 78.91 -8.0% 
AZAL Medical Research Council (MRC) United Kingdom FLTC 470 Orly 6.8% 
Ly The University of Hong Kong (HKU) China WAled/S) 185 7138 0.5% 
173 Radboud University Nijmegen (RU) Netherlands wis 461 63.38 13.2% 
174 University of Waterloo Canada 71.02 199 61.22 16.0% 
175 Yonsei University South Korea 7102 201 61.62 153% 
176 Karolinska Institute (Kl) Sweden 70.88 234 59123 19.7% 
IAT University of Massachusetts Medical School (UMMS) United States 70.59 143 72.00 -2.0% 
178 Pohang University of Science and Technology (POSTECH) South Korea 70.38 151 70.00 0.5% 
179 University of Notre Dame United States 10°37 258 86.33 -18.5% 
180 University of Cologne (UoC) Germany 69.90 235 ByAiIS} 22.3% 
181 University of California Santa Cruz (UCSC) United States 69.85 Soy 58.03 20.4% 
182 National Institute of Advanced Industrial Science and Technology (AIST) Japan 69/9) 185 79.48 -12.2% 
183 University of Warwick (Warwick) United Kingdom 69.68 392 61.07 14.1% 
184 King's College London (KCL) United Kingdom 69:57 209 71.59 -2.8% 
185 The Rockefeller University United States 69.55: 207 82.62 -15.8% 
186 University of Barcelona (UB) Spain 68.98 608 72.14 -44% 
187 Xi'an Jiaotong University (XJTU) China 67.98 167 42.98 58.2% 
188 Stockholm University Sweden 67.67 416 71.69 -5.6% 
189 East China University of Science and Technology (ECUST) China 67.48 i) 56.75 18.9% 
190 University of Sheffield United Kingdom 66.68 266 67.59 -1.3% 
191 University of Tubingen Germany 66.65 219 7743 -13.9% 
192 University of Amsterdam (UvA) Netherlands 66.56 532 60.41 10.2% 
193 University of Paris Sud (Paris 11) France 65.88 611 12233 -8.9% 
194 University of Glasgow United Kingdom 65.30 539 66.48 -1.8% 
195 National Tsing Hua University (NTHU) Taiwan 65.28 7A 66.23 -14% 
196 University of Ottawa (uOttawa) Canada 64.43 147 59120 8.8% 
197 The University of Texas MD Anderson Cancer Center United States 64.21 178 65.99 -2.7% 
198 Icahn School of Medicine at Mount Sinai (ISMMS) United States 64.14 200 53.91 19.0% 
199 University of Regensburg (UR) Germany 63.91 13] 53.73 18.9% 
200 Technische Universitat Berlin (TU Berlin) Germany 63.52 141 55.40 14.7% 
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TOP INSTITUTIONS: LIFE SCIENCES 


i Be © Bey ~ fe oo Bee» 


ig 


S40 


Harvard University 

National Institutes of Health (NIH) 

Stanford University (SU) 

Max Planck Society 

Massachusetts Institute of Technology (MIT) 

Yale University 

University of California San Francisco (UCSF) 
University of California, San Diego (UC San Diego) 
University of Toronto (U of T) 

Chinese Academy of Sciences (CAS) 

French National Centre for Scientific Research (CNRS) 
University of Oxford 

The Johns Hopkins University JHU) 

Columbia University in the City of New York (CU) 
University of Pennsylvania (Penn) 

University of Michigan (U-M) 

University of Cambridge 

University of Washington (UW) 

University of California Los Angeles (UCLA) 


University of California Berkeley (UC Berkeley) 
Washington University in St. Louis (WUSTL) 
Cornell University 

Duke University 

University College London (UCL) 


The University of Texas Southwestern Medical Center (UT Southwestern Medical Center) 


The University of Tokyo (UTokyo) 
New York University (NYU) 
University of North Carolina at Chapel Hill (UNC) 


The University of Chicago (UChicago) 


University of Wisconsin-Madison (UW Madison) 


Helmholtz Association of German Research Centres (Helmholtz) 


Baylor College of Medicine (BCM) 
Vanderbilt University (VU) 


University of California Davis (UC Davis) 


Memorial Sloan-Kettering Cancer Center (MSKCC) 

Swiss Federal Institute of Technology Zurich (ETH Zurich) 
Northwestern University (NU) 

Emory University 

University of Pittsburgh (Pitt) 

National Institute for Health and Medical Research (INSERM) 
The Scripps Research Institute (TSRI) 

McGill University 

Rutgers, The State University of New Jersey (RU) 
University of Massachusetts Medical School (UMMS) 
Spanish National Research Council (CSIC) 

The Rockefeller University 

Karolinska Institute (Kl) 

Imperial College London (ICL) 

The University of British Columbia (UBC) 

Icahn School of Medicine at Mount Sinai (ISMMS) 
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United States 
United States 
United States 
Germany 
United States 
United States 


United States 


United States 
Canada 
China 
France 
United Kingdom 
United States 
United States 
United States 
United States 
United Kingdom 
United States 
United States 
United States 
United States 
United States 
United States 
United Kingdom 
United States 
Japan 
United States 
United States 
United States 


United States 


Germany 
United States 
United States 
United States 
United States 
Switzerland 
United States 
United States 
United States 
France 
United States 
Canada 
United States 
United States 


Spain 


United States 
Sweden 
United Kingdom 
Canada 


United States 


610.16 
320.93 
226.07 
225.42 
182.86 
179.07 
177.02 
168.43 
157-03 
165.22 
155.06 
52.07 
46.19 
142.53 
141.62 
133.65 
138°57, 
125.90 
123.06 
121,51 
121.40 
13.45 
112.08 
106.93 
104.98 
o9.33 
99.21 
97,89 
94.05 
92.41 
90.06 
84.40 
80.27 
79.61 
7702 
76.08 
75.07 
74.39 
73.58 
wiles 
71.09 
71.03 
71.00 
66.97 
66.91 
66.68 
65.96 
65.08 
64.55 
63.95 
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207 


575 
1193 
208 
163 
eye 
295 
146 
222 
244 
175) 
198 


587.65 
369.99 
245.14 
216.73 
183.17 
189.31 
Bie77 
182.69 
128.58 
147.94 
145.94 
147.83 
140.18 
127.22 
175.58 
143.00 
155.77 
137.32 
26.98 
126.58 
137.34 
140.95 
113.86 
115.85 
107.98 
112.60 
92.73 
93.85 
93.92 
87.38 
85.65 
74.30 
74.80 
90.61 
68.47 
46.48 
91.03 
66.89 
81.12 
62.47 
95.51 
78.78 
61.90 
69.40 
62.94 
77.26 
55.50 
54.58 
53.90 
52.10 


I 
i 
N 

Pl 


TOP INSTITUTIONS: EARTH & ENVIRONMENTAL SCIENCES 
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13 


Chinese Academy of Sciences (CAS) 

French National Centre for Scientific Research (CNRS) 
Helmholtz Association of German Research Centres (Helmholtz) 
University of California, San Diego (UC San Diego) 
National Aeronautics and Space Administration (NASA) 
National Oceanic and Atmospheric Administration (NOAA) 
California Institute of Technology (Caltech) 

University of Washington (UW) 

Swiss Federal Institute of Technology Zurich (ETH Zurich) 
Woods Hole Oceanographic Institution (WHOl) 

U.S. Geological Survey (USGS) 

The University of Texas at Austin (UT Austin) 


Massachusetts Institute of Technology (MIT) 


University of Colorado Boulder (CU-Boulder) 

The University of Tokyo (UTokyo) 

Columbia University in the City of New York (CU) 
Pennsylvania State University (Penn State) 
University of California Berkeley (UC Berkeley) 
University of Cambridge 

National Center for Atmospheric Research (NCAR) 
Stanford University (SU) 

niversity of Hawaii at Manoa (UH Manoa) 


trecht University (UU) 


— Cc ie 


niversity of California Davis (UC Davis) 


co 


c 


niversity of Maryland, College Park (UMCP) 

Princeton University 

Institute of Research for Development (IRD) 

Spanish National Research Council (CSIC) 

Yale University 

Australian National University (ANU) 

University of Bristol (UoB) 

University of Wisconsin-Madison (UW Madison) 
University of Oxford 

Harvard University 

University of California Los Angeles (UCLA) 

Max Planck Society 

Oregon State University (OSU) 

The University of Arizona (UA) 

University of Bremen 

University of Toronto (U of T) 
National Institute of Geophysics and Volcanology (INGV) 
Brown University 

University of Minnesota (UMN) 
Colorado State University (CSU) 
T 
Pp 
U 
U 
U 


aris Diderot University (Paris 7) 
niversity of Florida (UF) 
niversity of Miami (UM) 


.S. Department of Energy (DOE) 


apan Agency for Marine-Earth Science and Technology JAMSTEC) 


he Commonwealth Scientific and Industrial Research Organisation (CSIRO) 


China 
France 
Germany 
United States 
United States 
United States 
United States 
United States 
Switzerland 
United States 
United States 
United States 
United States 
United States 

Japan 
United States 
United States 
United States 

United Kingdom 
United States 
United States 
United States 
Netherlands 
United States 

Japan 
United States 
United States 


France 
Spain 
United States 
Australia 
United Kingdom 
United States 
United Kingdom 
United States 
United States 

Germany 
United States 


United States 
Germany 
Canada 
Italy 
United States 
United States 
United States 
Australia 
France 
United States 
United States 
United States 


74.88 
74.14 
68.96 
54.32 
52.68 
50.32 
50.17 
49.12 
41.69 
41.31 

38.62 
37.58 
37.09 
36.78 
36.65 
SZ05 
32.14 
3005 
28.08 
25.08 
24.65 
23.86 
23.00 
22.96 
22.61 
2250 
22.41 
22.18 
22.02 
21.88 
20.98 
20.51 
20.22 
20.05 
19.46 
19.05 
18.91 

18.91 
18.19 
13.7 
18.16 
17.95) 
1795 
17/93) 
i75il 

17.34 
17.07 
16.88 
16.25 
16.04 
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11S 


104 


100 
132 


105 


GLOBAL 


TT 
56.92 
63.81 
47.45 
57.95 
59.91 
50.91 
42.34 
33.1% 
31.00 
39.76 
Palecrt 
20.62 
44.08 
3/33 
41.25 
20.15 
2297 
18.75 
25.07 
32.09 
28.04 
21,31 
16.60 
28.62 
15.96 
20.74 
16.82 
18.56 
IAS) 
18.55 
17.84 
24.37 
33.37 
32.26 
20.81 
23.81 
197 
15.43 
9.65 
12.10 
18.73 
8.59 
‘lle IU} 
17.98 
16.92 
Syily/ 
16.78 
10.15 
16.44 


-2.9% 
30.2% 
8.1% 
14.5% 
-9.1% 
-16.0% 
-1.5% 
16.0% 
25.7% 
33.3% 
-2.9% 
78.0% 
79.8% 
-16.6% 
-1.8% 
-20.8% 
59.5% 
34.8% 
49.7% 
1.2% 
-23.2% 
-14.9% 
8.0% 
38.3% 
-21.0% 
41.1% 
8.1% 
31.8% 
18.6% 
25.0% 
13.1% 
15.0% 
-17.0% 
-40.8% 
-39.7% 
-8.5% 
-20.6% 
-4.4% 
179% 
88.4% 
50.1% 
-4.2% 
108.9% 
18.4% 
-2.6% 
24% 
108.9% 
0.6% 
60.1% 
-2.4% 


NATURE INDEX 2015 | GLOBAL | 


S41 


NATURE INDEX FEET 
TOP INSTITUTIONS: CHEMISTRY 


ARTICLE 2013-2014 
2014 —‘ INSTITUTION COUNTRY WFC COUNT CHANGE IN WFC 


il Chinese Academy of Sciences (CAS) China le2a} 1,396 681.87 See 
2 French National Centre for Scientific Research (CNRS) France 250.51 921 22197 10.3% 
3 Max Planck Society Germany 184.21 425 202.85 -9.2% 
4 Massachusetts Institute of Technology (MIT) United States 169.74 274 164.75 3.0% 
iS) Stanford University (SU) United States 159,72 254 145.69 9.6% 
6 Peking University (PKU) China 151.32 366 142.68 6.1% 
7 Kyoto University Japan 151.26 233i, 5y7Al2 -3.7% 
8 The University of Tokyo (UTokyo) Japan 143.72 228 159.48 -9.9% 
9 anyang Technological University (NTU) Singapore 143.06 217 123.90 15.5% 
10 Harvard University United States 141.23 256 143.52 -1.6% 
11 University of California Berkeley (UC Berkeley) United States 140.86 289 131.56 7.1% 
12 orthwestern University (NU) United States 136.36 211 15292 -10.8% 
is} Swiss Federal Institute of Technology Zurich (ETH Zurich) Switzerland 127.29 217 102.80 23.8% 
14 University of Oxford United Kingdom 126.10 202 11939 5.6% 
i) Zhejiang University (ZJU) China 125.25 178 86.42 44.9% 
16 Nanjing University (NJU) China 120.32 207 117.34 2.5% 
WW, Swiss Federal Institute of Technology in Lausanne (EPFL) Switzerland 116.01 219 80.35 44.4% 
18 Fudan University China 114.02 186 80.10 42.3% 
19 The University of Texas at Austin (UT Austin) United States 112.69 161 109.67 2.7% 
20 University of Cambridge United Kingdom 110.12 186 105.84 4.0% 
Pil University of Illinois at Urbana Champaign (UIUC) United States 108.91 191 115.67 -5.8% 
oe Helmholtz Association of German Research Centres (Helmholtz) Germany 108.35 302 91.62 18.3% 
23 University of Science and Technology of China (USTC) China 108.19 223 93.78 15.4% 
24 ndian Institutes of Technology (IITs) India 106.58 141 74.77 42.6% 
25 Osaka University Japan 105.69 ysl 106.33 -0.6% 
26 Tsinghua University (TH) China 101.79 201 92.06 10.6% 
Zai/ University of Wisconsin-Madison (UW Madison) United States 96.53 139 105.23 -8.3% 
28 ational University of Singapore (NUS) Singapore 93.39 iy 83.81 11.4% 
29 California Institute of Technology (Caltech) United States ©2335) 144 107.31 -13.9% 
30 Council of Scientific and Industrial Research (CSIR) India 90.40 ial 79.46 13.8% 
iil Lawrence Berkeley National Laboratory (LBNL) United States 85.03 248 74.57 14.0% 
32 Lanzhou University (LZU) China 84.31 118 5035 67.5% 
33 University of California Santa Barbara (UCSB) United States 83.37 137, 62.14 34.2% 
34 University of Pennsylvania (Penn) United States 82.19 124 80.23 24% 
315) Tohoku University Japan 81.32 138 62.89 29.3% 
36 The Scripps Research Institute (TSRI) United States 79.61 130 100,52 -21.6% 
37 University of California, San Diego (UC San Diego) United States TAO7 134 78.52 0.7% 
38 Jilin University (JLU) China rv Aows 128 73.88 5.0% 
39 University of California Los Angeles (UCLA) United States 75.45 134 87.43 -13.7% 
40 University of North Carolina at Chapel Hill (UNC) United States 74.34 101 75108 -1.0% 
Al The University of Manchester (UoM) United Kingdom 73.82 132 53.06 39.1% 
42 Imperial College London (ICL) United Kingdom 73.20 126 62.12 17.8% 
43 Korea Advanced Institute of Science and Technology (KAIST) South Korea 73.03 133 56:93 28.3% 
44 University of Michigan (U-M) United States 72.80 112 92.32 -21.1% 
45 Xiamen University (CMU) China 71.25 138 59.87 19.0% 
46 Ludwig Maximilian University of Munich (LMU) Germany 71.20 130 BoZ2 28.9% 
47 Yale University United States 7116 122 46.80 51.9% 
48 Sichuan University (SCU) China FOrS 98 68.45 3.4% 
49 University of Toronto (U of T) Canada 69.01 120 85.42 -19.2% 
50 Seoul National University (SNU) South Korea 68.64 111 70.70 -2.9% 
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TOP INSTITUTIONS: PHYSICAL SCIENCES 
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Chinese Academy of Sciences (CAS) 

French National Centre for Scientific Research (CNRS) 
Max Planck Society 

Stanford University (SU) 


Helmholtz Association of German Research Centres (Helmholtz) 


Massachusetts Institute of Technology (MIT) 
The University of Tokyo (UTokyo) 
Harvard University 
University of Cambridge 
University of California Berkeley (UC Berkeley) 
Swiss Federal Institute of Technology Zurich (ETH Zurich) 
University of California Santa Barbara (UCSB) 
University of Oxford 
University of Michigan (U-M) 
Peking University (PKU) 
Russian Academy of Sciences (RAS) 
Spanish National Research Council (CSIC) 
Swiss Federal Institute of Technology in Lausanne (EPFL) 
Princeton University 
The University of Texas at Austin (UT Austin) 
Tsinghua University (TH) 
California Institute of Technology (Caltech) 
University of Maryland, College Park (UMCP) 
University of Illinois at Urbana-Champaign (UIUC) 
Lawrence Berkeley National Laboratory (LBNL) 
University of California Los Angeles (UCLA) 
National University of Singapore (NUS) 
Kyoto University 
Los Alamos National Laboratory (LANL) 
National Research Council (CNR) 
Tohoku University 
The University of Chicago (UChicago) 
University of Science and Technology of China (USTC) 
University of California, San Diego (UC San Diego) 
anyang Technological University (NTU) 
Seoul National University (SNU) 
mperial College London (ICL) 
orthwestern University (NU) 
Pennsylvania State University (Penn State) 
Cornell University 
ational Institute of Standards and Technology (NIST) 
anjing University (NJU) 


ational Institute for Material Science (NIMS) 

Osaka University 

Yale University 

University of Colorado Boulder (CU-Boulder) 

Atomic Energy and Alternative Energies Commission (CEA) 
Argonne National Laboratory (ANL) 

Columbia University in the City of New York (CU) 


Korea Advanced Institute of Science and Technology (KAIST) 


China 
France 
Germany 
United States 
Germany 
United States 
Japan 
United States 
United Kingdom 
United States 
Switzerland 
United States 
United Kingdom 
United States 
China 
Russia 
Spain 
Switzerland 
United States 
United States 
China 
United States 
United States 
United States 
United States 
United States 


Singapore 
Japan 
United States 
Italy 
Japan 
United States 
China 
United States 
Singapore 
South Korea 
United Kingdom 
United States 
United States 
United States 
United States 
China 
Japan 
Japan 
United States 
United States 
France 
United States 
United States 


South Korea 


413.44 
333.36 
300.72 
221.86 
213.14 
212.29 
204.65 
200.80 
87.59 
149.28 
40.09 
115.63 
113.00 
112.36 
IMME 7, 
111.96 
08.99 
104.69 
102.49 
98.37 
98.25 
95:56 
90.92 
86.99 
86.75 
81.40 
80.80 


VMs) 
76.30 
75.80 
75165: 
75.47 
74.91 
74.21 
72.88 
72.34 
71.34 
Hales 
68.96 
68.66 
68.50 
68.43 
68.14 
err 
66.85 
66.06 
65.66 
bs.o2 
61.89 
58.92 
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1376 406.86 
2,812 335.41 
1,948 353.91 
652 211.18 
819 217.24 
718 226.42 
737. 193.50 
1,149 191.30 
839 170.15 
735 136.57 
520 132.35 
ary 120.80 
594 126.23 
464 121.38 
454 105.09 
724 109.38 
1,281 98.40 
A401 102.90 
469 114.36 
330 98.67 
363 87.94 
839 100.29 
542 96.06 
297 111.83 
514 76.52 
314 97.72 
171 88.74 
330 91:02 
289 84.24 
282 93.70 
all 56.92 
392 63.01 
301 71.40 
259 en! 
141 71.87 
242 66.08 
370 79.35 
209 91.07 
369 7744 
265 73.02 
254 67.83 
233 64.94 
166 43.97 
245 62.61 
382 60.96 
338 55.85 
765 91.23 
213 TAME) 
Soil 74.99 
103 49.76 


16% 
-0.6% 
-15.0% 
5.1% 
-19% 
6.2% 
5.8% 
5.0% 
10.3% 
10.1% 
5.8% 
-4.3% 
-10.5% 
7.4% 
6.6% 
24% 
10.8% 
1.7% 
10.4% 
-0.3% 
11.7% 
-4.7% 
5.4% 
-22.2% 
13.4% 
-16.7% 
-8.9% 
15.2% 
-9.4% 
-19.1% 
-22.0% 
19.8% 
49% 
0.7% 
14% 
9.5% 
-10.1% 
-21.7% 
-10.9% 
-6.0% 
1.0% 
5.4% 
55.0% 
8.1% 
9.7% 
18.3% 
-28.0% 
10.4% 
-17.5% 
18.4% 
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TOP INSTITUTIONS IN NATURE AND SCIENCE JOURNALS 


ARTICLE 2013-2014 
2014 ~— INSTITUTION COUNTRY WFC COUNT CHANGE IN WFC 


1 Harvard University United States 75.29 210 76.41 -1.3% 
2 Stanford University (SU) United States 41.44 103 38.30 8.2% 
3 Massachusetts Institute of Technology (MIT) United States 34.13 123 45.59 -25.1% 
4 Max Planck Society Germany 30.40 120 32.87 -7.5% 
By National Institutes of Health (NIH) United States 24.04 69 27:57 -12.8% 
6 University of California Berkeley (UC Berkeley) United States 21.35 25 21.58 -1.1% 
7 University of Washington (UW) United States 20.63 63 14.49 42.3% 
8 University of California, San Diego (UC San Diego) United States 20.29 54 17.83 13.9% 
9 Yale University United States 20.16 56 17.04 18.3% 
0) French National Centre for Scientific Research (CNRS) France 19.46 166 15.12 28.7% 
li California Institute of Technology (Caltech) United States 18.61 62 T2973 46.2% 
12 University of Cambridge United Kingdom 18.12 82 20.69 -12.4% 
3 University of California San Francisco (UCSF) United States 17.96 48 24.04 -25.3% 
14 University of Oxford United Kingdom 77S) 60 13.84 28.5% 
15) Columbia University in the City of New York (CU) United States 17.45 64 1975 -11.6% 
16 Helmholtz Association of German Research Centres (Helmholtz) Germany 16.66 80 10.95 52.1% 
y/ The Scripps Research Institute (TSRI) United States 14.18 35 18.43 -23.1% 
18 The University of Tokyo (UTokyo) Japan 13.21 44 1555 -15.1% 
19 Chinese Academy of Sciences (CAS) China Heel 58 18.7) -29.9% 
20 University of Michigan (U-M) United States 13.04 44 9.36 39.3% 
21 Weizmann Institute of Science (WIS) Israel 12.91 22 6.67 93.5% 
22 Swiss Federal Institute of Technology Zurich (ETH Zurich) Switzerland 12.74 43 13.98 -8.9% 
23 Princeton University United States 12:29 32 13.50 -9.0% 
24 The University of Texas at Austin (UT Austin) United States 12.20) 40 4.60 165.3% 
25 Cornell University United States 11.83 leis) 13131 -11.1% 
26 The Johns Hopkins University HU) United States 11.82 39 1233 -4.1% 
Pas Washington University in St. Louis (WUSTL) United States 11.40 40 8.41 35.6% 
28 Medical Research Council (MRC) United Kingdom 11.34 24 9.29 22.0% 
29 University of Wisconsin—Madison (UW Madison) United States 11.14 35 3.16 252.1% 
30 Memorial Sloan-Kettering Cancer Center (MSKCC) United States 10.81 29 5.08 112.9% 
ey! New York University (NYU) United States 10.54 25 1055 -0.1% 
So) University of California Los Angeles (UCLA) United States 10.35 4l IDS) -18.8% 
$8} The University of Texas Southwestern Medical Center (UT Southwestern Medical Center) United States 9.88 20 11.79 -16.2% 
34 The University of Chicago (UChicago) United States O33 47 E75) -20.6% 
35) F. Hoffmann-La Roche Ltd Switzerland 9.18 20 2.58 255.9% 
36 The Rockefeller University United States 9.14 20 11.45 -20.2% 
37 University of Pennsylvania (Penn) United States 9.01 30 10.23 -11.9% 
38 Lawrence Berkeley National Laboratory (LBNL) United States 9.00 52 5.37 50.7% 
39 University of California Santa Cruz (UCSC) United States 8.93 37 4.54 96.6% 
40 University of Toronto (U of T) Canada 8.64 38 15.70 -44.9% 
Al Duke University United States 8.55 23 7.14 19.8% 
42 University of Maryland, College Park (UMCP) United States 8.49 40 5.21 63.1% 
43 Kyoto University Japan 7TA4 25 748 -0.5% 
44 University of Copenhagen (UCPH) Denmark 743 50 pill 45.6% 
45 RIKEN Japan Ue 32 9.62 -23.8% 
46 University of California Davis (UC Davis) United States ws 29 8.06 -11.6% 
47 University College London (UCL) United Kingdom 7.10 41 169 -7.6% 
48 Pennsylvania State University (Penn State) United States 7.01 25 6.26 11.9% 
49 University of Zurich (UZH) Switzerland 6.70 19 4.81 39.1% 
50 Peking University (PKU) China 6.48 28 4.10 58.0% 
Weighted fractional count (WFC) for each institution is shown to two decimal places only. These results are based on the most recent data available as of 11 September 2014. 
When two or more institutions have the same WFC, their positions are determined by the Owing to continual refinements of the data, the figures in the database are liable to 
thousandth place (or beyond). change and might differ to those printed in the supplements. 
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